rna-tools.online

rna.tools.online

The rna-tools.online server hosts many bioinformatic tools to perform various operations on different types of RNA data with ease. This project has been developed to give users access to a comfortable interface that doesn’t require any knowledge of programming or command line execution.

Feel free to contribute to improve this documentation at Google Docs.

Motivation

The syntax for selection

Re-run jobs

Form validation

Tools

RNA 3D structure conversion from CIF to PDB

Conversion between mmCIF and PDB

RNA 3D structure analysis

Get sequences

Get secondary structures

Contacts (Interactions) classification with ClaRNA

Analysis with X3DNA

RNA 3D structure standardization

RNA 3D structure editing

RNA 3D structure minimization

RNA 3D structures comparison

RNA 3D model quality assessment

Demo files

Feedback is welcome!

Motivation

Significant improvements have been made in the efficiency and accuracy of RNA 3D structure prediction methods in recent years; however, many tools developed in the field stay exclusive to only a few bioinformatic groups. To perform a complete RNA 3D structure modeling analysis as proposed in the RNA-Puzzles publications, e.g., [1], the researchers must familiarize themselves with a quite complex set of tools.

The goal of the rna-tools package [2] was to provide a more abstract way to process data for RNA 3D modeling. Nowadays, rna-tools has become a wide toolbox to approach every aspect of working with various types of RNA data. The package was used to provide computational resources for the RNA-puzzle community [2] and also offered tools for other biological applications, e.g., [3,4,5].

However, using rna-tools requires the installation of a mixture of library and tools and basic knowledge of the Linux terminal command line. To give a chance for all biologists to take advantage of developments in RNA 3D structure prediction, we provide a user-friendly server to perform many standard analyses required for the typical modeling workflow: secondary structure prediction, 3D structure manipulating and editing, structure minimization, structures analysis, and comparison tools.

In the server, each tool has been translated into a web application. The user can use the web browser, select the tool to use, and upload its own files. Once the computation is done, the webserver allows the user to download the results and explore the steps that were used to perform the analysis. This will also provide a way to learn how the rna-tool package can be used and will spur the user into trying to perform more customized analyses.

All tools are well documented and examples are provided to help the users to understand the tools.

[1] Z. Miao, R. W. Adamiak, M. Antczak, M. J. Boniecki, J. M. Bujnicki, S.-J. Chen, C. Y. Cheng, Y. Cheng, F.-C. Chou, R. Das, N. V. Dokholyan, F. Ding, C. Geniesse, Y. Jiang, A. Joshi, A. Krokhotin, M. Magnus, O. Mailhot, F. Major, T. H. Mann, P. Piątkowski, R. Pluta, M. Popenda, J. Sarzynska, L. Sun, M. Szachniuk, S. Tian, J. Wang, J. Wang, A. M. Watkins, J. Wiedemann, Y. Xiao, X. Xu, J. D. Yesselman, D. Zhang, Y. Zhang, Z. Zhang, C. Zhao, P. Zhao, Y. Zhou, T. Zok, A. Zyła, A. Ren, R. T. Batey, B. L. Golden, L. Huang, D. M. Lilley, Y. Liu, D. J. Patel, and E. Westhof, “RNA-Puzzles Round IV: 3D structure predictions of four ribozymes and two aptamers.,” RNA, vol. 26, no. 8, pp. 982–995, Aug. 2020.

[2] M. Magnus, M. Antczak, T. Zok, J. Wiedemann, P. Lukasiak, Y. Cao, J. M. Bujnicki, E. Westhof, M. Szachniuk, and Z. Miao, “RNA-Puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools.,” Nucleic Acids Research, vol. 48, no. 2, pp. 576–588, Jan. 2020.

[3] M. Magnus, K. Kappel, R. Das, and J. M. Bujnicki, “RNA 3D structure prediction guided by independent folding of homologous sequences.,” BMC Bioinformatics, vol. 20, no. 1, pp. 512–15, Oct. 2019.

[4] K. Eysmont, K. Matylla-Kulinska, A. Jaskulska, M. Magnus, and M. M. Konarska, “Rearrangements within the U6 snRNA Core during the Transition between the Two Catalytic Steps of Splicing.,” Molecular Cell, vol. 75, no. 3, pp. 538–548.e3, Aug. 2019.

[5] F. Stefaniak and J. M. Bujnicki, “AnnapuRNA: A scoring function for predicting RNA-small molecule binding poses,” PLoS Comput Biol, vol. 17, no. 2, p. e1008309, Feb. 2021

Documentation

The Tools page is divided into seven categories, each one describing the type of tools it contains. The tools are listed one below another, and a small description follows the name of each tool. By clicking on the hyperlink it is possible to reach the page of every single tool. The sections in which the tools have been divided are the following:

RNA 3D structure conversion from CIF to PDB
RNA 3D structure analysis
RNA 3D structure standardization
RNA 3D structure editing
RNA 3D structure minimization
RNA 3D structures comparison
RNA 3D model quality assessment

The tools may differ from one another, but the general pipeline is the following:

The user opens the tool page for the required actions;
Then, following the instruction displayed on the page, the user can drag and drop input files into a box on the page or “ Fetch ” files from another job, and press “ Run! ”
The tool will start to work in the background and after a few seconds or minutes (depending on the tool) , the result files will be generated.
The user will then be able to retrieve the data of the analysis and re-use it to perform other tasks on the rna-tools web server.

Demo

Each Tool has an option to “Load demo”. If the user is unsure about the file or the correct formats that the tool accepts, “run” the demo and download and explore the loaded example file and the resulting output.

The tool-specific documentation for each tool can be found on each page, at the bottom.

Job id

If you reload the page when the job id (identifier) is not in your URL then a new folder will be created for the user:

http://rna-tools.online/tools/calc-rmsd/

To access the same folder and the same job, in a new browser window or after refreshing your page, you need a full URL:

http://rna-tools.online/tools/calc-rmsd/11e4e814

Fetch

The user can use “Fetch” with the job ID of another job to get files into a current job.

Syntax for selection

Various tools use the selection scheme.

For “Mutate”, “A:1A+2A+3A+4A,B:13A” defines mutating all selected residues into adenine (“A”). “A:1A” means taking the first residue from chain A and mutating it into A (adenine). The user can combine resides from the same chain with “+” and add residues to be mutated in another change by using “,”.

For “Calculate RMSD”, ranges of residues can be defined using “-”, e.g., “A:1-17+24-110+115-168”, meaning, select residues from 1 to 17 (including), and from 24 to 110 (including) and from 115 to 168 (including) of chain A.

Moreover for “Calculate RMSD” negative selection is possible to remove a single atom from selection, e.g. “A/57/O2\'’, meaning remove “O2’” of residues 57 of chain A (\ is required to protect ‘ from being interpreted as an end of a string).

Re-run jobs

The goal of the server was to implement an interactive workflow to allow the user for complex analyses. The user can work in the same server folder by removing some input files and keeping outputs of the tools, and by adding new files, one can perform interactively more complex analyses. Even a finished job can be easily re-run (for example, after removing one of the input files or adding new files) to get a new result.

Form validation

Some forms need extra information, when missing, the following information will be displayed and the submission will be stopped.

Figure. Form validation in enabled for some tools when the input is required.

In some tools, input verification is challenging without running the tool. In these cases, such as "Calculate RMSD", the tool reports the problem in the output of a given tool. Here, because of the different lengths of segments taken for the calculations (chain A, residues 1-17 (including) vs chain A, residues 1-18 (including)), the number of atoms is different and RMSD can not be calculated. The information about the issues is shown in the output.

Figure. Errors can be reported in the output of a given tool.

Tools

RNA 3D structure conversion from CIF to PDB

Conversion between mmCIF and PDB

As only a limited number of chains and atoms can be deposited in the PDB format, the mmCIF format has been introduced to provide an alternative way to save structures. As the predicted RNA structures are normally within the capability of the PDB format, this format is still used in the RNA-Puzzles community. Moreover, many tools available in the field of RNA 3D bioinformatics still are using the PDB format and likely will not be updated for the mmCIF format. Thus, we decided to provide a web application to convert mmCIF format to PDB format (“Convert CIF files to PDB”) and reverse (“Convert PDB files to CIF”). These two tools are based on the open-source version of PyMOL.

Convert CIF files to PDB

Convert PDB files to CIF

RNA 3D structure analysis

The first group of tools includes programs that aim to facilitate the analysis of RNA 3D structure. With “Get sequences” the user can easily obtain RNA sequences for the uploaded PDB files. To obtain secondary structures from the PDB files, the tool “Get secondary structures” can be used that is based internally on 3DNA /DSSR software. The 3DNA/DSSR software is also used for the next tool, “Analysis with X3DNA” which provides various detailed statistics for PDB files such as a list of RNA elements (helixes, stems, motifs, nucleotide modifications) and configuration of base pairs. The last tool in this group uses ClaRNA to classify the contacts (interactions) between base pairs in a PDB file.

Get sequences get sequences of a bunch of PDB files

Get secondary structures get secondary structures of a bunch of PDB files

Analysis with X3DNA get statistics and details on PDB files

Analysis with ClaRNA get interactions detected for PDB files

Get sequences

There are two ways how to obtain the sequences from PDB files. As the demo, we provide four models from the RNA-Puzzle 21 target.

The default options will show the filename after ‘#’ and the sequence of all chains (in here, only one chain is present per model):

# 21_3dRNA_1_rpr

>A:1-41