It is intended to provide information on the sequences and functions of transcripts which do not code for proteins, but perform regulatory roles in the cell. Currently, the database includes over 30,000 individual sequences from 99 species of Bacteria, Archaea and Eukaryota. The primary source of sequences included in the database was the GenBank. Additional annotation information for mouse and human ncRNAs was derived from FANTOM3 database and H-inviational Integrated Database of Annotated Human Genes version 3.4, respectively. Genome mapping information was derived from tha data available at the UCSC Genome Browser site. The sequences and annotations of small cytoplasmic RNAs from bacteria, for which annotation is lacking in the genome sequences, were derived from the Rfam database. The microRNAs or snoRNAs which were available in previous editions, as well as other housekeeping (infrastructural) RNAs (e.g. rRNA, tRNA, snRNA, SRP RNA) are not included in our database to avoid redundancy with more specialized databases which emerged in recent years.
fRNAdb is a novel database service that hosts a large collection of non-coding transcripts including annotated/non-annotated sequences from the H-inv database, NONCODE and RNAdb. A set of computational analyses have been performed on the included sequences. These analyses include RNA secondary structure motif discovery, EST support evaluation, cis-regulatory element search, protein homology search, etc.
Database compiles all complete or nearly complete SSU (small subunit) and LSU (large subunit) ribosomal RNA sequences. Sequences are provided in aligned format. Alignment takes into account secondary structure information derived by comparative sequence analysis of thousands of sequences. Additional information such as literature references, taxonomy, secondary structure modles and nucleotide variability maps, is also available.
The Subviral RNA database facilitates the research and analysis of viroids, satellite RNAs, satellite viruses, the human hepatitis delta virus, and related RNA sequences. It integrates a large number of Subviral RNA sequences, their respective RNA motifs, analysis tools, related publication links and additional pertinent information to allow users to efficiently retrieve and analyze relevant information about these small RNA agents. The Subviral RNA Database contains 2877 sequences indexed in 83 species and 4 main groups.
https://lter.kbs.msu.edu/data/terms-of-use/https://lter.kbs.msu.edu/data/terms-of-use/
The Ribosomal RNA Database is curated by the Schmidt Laboratory and...
Created to collate literature evidence that support gene or protein association with the stress granules (SGs) and P-bodies (PBs), with the goal of consolidating high-confidence proteomes of SG and PB. Protein database for mammalian SGs and processing bodies (PBs), which are related condensates.
tRNAdb 2009 provides a powerful and fast search engine. Taxons can be identified by browsing the taxonomic tree or by using the search form. Queries can include DNA or RNA sequences, amino acid family, anticodon, references, Pubmed-ID of the reference, gene ID as well as comments. In addition, individual searches concerning sequence or structure characteristics are possible. The server accepts IDs of the new as well as the old tRNA database as queries and can perform BLAST searches. All sequences can be downloaded in several file- and alignment formats on the result list. This site is hosted and maintained in a cooperation between the universities of Leipzig (Germany), Marburg (Germany) and Strasbourg (France). Recent Visitors
CSRDB is a bioinformatics resource for cereal crops consisting of large-scale datasets of maize and rice and small RNA sequences. The sequences were generated by 454 Life Science sequencing. The small RNA sequences have been mapped to the rice genome and available maize genome sequence and are presented in two genome browser datasets using the Generic Genome Browser. Potential target sequences representing mature mRNA sequences have been predicted using the FASTH software from the Zuker lab. and access to the resulting small RNA target pair (SRTP) dataset has been made available through a mysql based relational database. Within the genome browser the small RNAs have links to the SRTP database that will return a list of potential targets. The SRTP database may also be searched independently using both small RNA and target transcript queries. Data linking and integration is the main focus of this interface and to this aim links are present in the SRTP results pages back to the browser and the SRTP database as well as external sites.
This is a public resource highlighting efforts at ARS in developing small RNA genome information for the potato genome. Updates and progress are reported here. Resources in this dataset:Resource Title: Web Page. File Name: Web Page, url: https://potato.pw.usda.gov
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Introduction The domestic-animal lncRNA database (ALDB) is the first comprehensive database with a focus on the domestic-animal lncRNAs. ALDB currently comprises 12,103 pig lincRNAs, 8,923 chicken lincRNAs, and 8,250 cow lincRNAs, which we have identified using computational pipeline in this study. Moreover, ALDB provides related useful data, such as genome-wide expression profile and animal quantitative trait loci (QTLs), that is not available in the existing lncRNA database (lncRNAdb and NONCODE), along with convenient tools, such as BLAST, GBrowse and flexible search functionalities.
This published compressed file(.zip) includes the following files: Pig (Sus scrofa) lincRNAs Pig lincRNAs (on SS_10.2 in GFF3 format) Pig lincRNAs (on SS_10.2 in GTF format) Pig lincRNAs (in fasta format) Pig lincRNAs (on SS_10.2 in GTF format) [Identified by Zhou et al. ] Chicken (Gallus gallus) lincRNAs Chicken lincRNAs (on GG_4.0 in GFF3 format) Chicken lincRNAs (on GG_4.0 in GTF format) Chicken lincRNAs (in fasta format) Cow (Bos taurus) lincRNAs Cow lincRNAs (on UMD_3.1 in GFF3 format) Cow lincRNAs (on UMD_3.1 in GTF format) Cow lincRNAs (in fasta format) Pig (Sus scrofa) expression Pig FPKM expression levels, derived from ERA178851 using TopHat and Cufflinks Pig count expression levels, derived from ERA178851 using SummaryOverlaps Chicken (Gallus gallus) expression Chicken FPKM expression levels, derived from SRA059960 using TopHat and Cufflinks Chicken count expression levels, derived from SRA059960 using SummaryOverlaps Cow (Bos taurus) expression Cow FPKM expression levels, derived from SRA059960 using TopHat and Cufflinks Cow count expression levels, derived from SRA059960 using SummaryOverlaps CNCI of lincRNA transcripts ALDB_pig_lincRNAs_with_CNCI ALDB_chicken_lincRNAs_with_CNCI ALDB_cow_lincRNAs_with_CNCI
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Searching for similar sequences in a database via BLAST or a similar tool is one of the most common bioinformatics tasks applied in general, and to non-coding RNAs in particular. However, the results of the search might be difficult to interpret due to the presence of partial matches to the database subject sequences. Here, we present rboAnalyzer – a tool that helps with interpreting sequence search result by (1) extending partial matches into plausible full-length subject sequences, (2) predicting homology of RNAs represented by full-length subject sequences to the query RNA, (3) pooling information across homologous RNAs found in the search results and public databases such as Rfam to predict more reliable secondary structures for all matches, and (4) contextualizing the matches by providing the prediction results and other relevant information in a rich graphical output. Using predicted full-length matches improves secondary structure prediction and makes rboAnalyzer robust with regards to identification of homology. The output of the tool should help the user to reliably characterize non-coding RNAs in BLAST output. The usefulness of the rboAnalyzer and its ability to correctly extend partial matches to full-length is demonstrated on known homologous RNAs. To allow the user to use custom databases and search options, rboAnalyzer accepts any search results as a text file in the BLAST format. The main output is an interactive HTML page displaying the computed characteristics and other context of the matches. The output can also be exported in an appropriate sequence and/or secondary structure formats.
https://www.arb-silva.de/silva-license-information/https://www.arb-silva.de/silva-license-information/
The SILVA database project provides comprehensive, quality checked and regularly updated databases of aligned small (16S / 18S, SSU) and large subunit (23S / 28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukarya).
A 16S rRNA gene database which provides chimera screening, standard alignment, and taxonomic classification using multiple published taxonomies.
A database on nucleotide sequences of 5S rRNAs and their genes. The database contains 1985 primary structures of 5S rRNA and 5S rDNA, and was last updated in 2002, according to the website. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms. The sequences for particular organisms can be retrieved as single files using a taxonomic browser or in multiple sequence structural alignments. The multiple sequence alignments of 5S ribosomal RNAs can be downloaded in TAB-delimited and FASTA formats.
The Rfam database is a collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs). The families in Rfam break down into three broad functional classes: non-coding RNA genes, structured cis-regulatory elements and self-splicing RNAs. Typically these functional RNAs often have a conserved secondary structure which may be better preserved than the RNA sequence. The CMs used to describe each family are a slightly more complicated relative of the profile hidden Markov models (HMMs) used by Pfam. CMs can simultaneously model RNA sequence and the structure in an elegant and accurate fashion.
GRSDB is a database of G-quadruplexes and contains information on composition and distribution of putative Quadruplex-forming G-Rich Sequences (QGRS) mapped in the eukaryotic pre-mRNA sequences, including those that are alternatively processed (alternatively spliced or alternatively polyadenylated). The data stored in the GRSDB is based on computational analysis of NCBI Entrez Gene entries and their corresponding annotated genomic nucleotide sequences of RefSeq/GenBank.
BPS is a database of RNA base pairs with quantitative information on the spatial arrangements of interacting bases, including higher-order base associations, and the context of these interactions in high-resolution crystal structures. The structures are taken from the Nucleic Acid Database (NDB), and the base pairs are identified and characterized with the 3DNA software package. The interactions are classified in terms of residue identities, base-pair positioning, and hydrogen-bonding patterns and related to the structural context in which they occur. A user can browse the atlas of base-pair patterns and carry out searches for patterns of specific types or from specific structures.
It is a database that details the interactions of extruded, unpaired RNA nucleotide bases. It presents and classifies the protein binding pockets that accommodate them, and also allows the recognition of similar protein binding patters involved in interactions with different RNA molecules. Given an unbound structure of a target protein, it allows the prediction of its RNA nucleotide binding sites. The goal of this database is to describe, classify, and predict the interactions between protein binding sites and single-stranded RNA bases. Specifically, RsiteDB describes the protein binding pockets that accommodate extruded nucleotides not involved in RNA base pairing. RsiteDB has two modes of operation. Analysis and classification of protein-RNA interactions: Given a protein-RNA complex RsiteDB analyzes its nucleotide and dinucleotide binding sites. It details the properties of the protein binding pockets that accommodate these extruded nucleotides and presents a list of proteins with similar binding pockets. These proteins may have a totally different overall sequences and structural folds. RsiteDB details and visualizes the features shared by all the binding sites classified to the same cluster. Prediction of RNA dinucleotide binding sites: Given a target, potentially unbound, protein structure we search its surface for regions similar to the created 3-D consensus binding patterns of RNA dinucleotides. The recognized regions are predicted to serve as binding sites. Using leave-one-out tests, the success rate of these predictions was estimated to be about 80%. It must be noted that currently we do not aim to predict whether a protein can bind RNA; rather, given an unbound RNA binding protein, our goal is to predict its binding sites and their modes of interaction. In addition, due to a low number of single nucleotide clusters, currently, we do not use them for the prediction.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundLong noncoding RNAs (lncRNAs) have attracted significant attention in recent years due to their important roles in many biological processes. Domestic animals constitute a unique resource for understanding the genetic basis of phenotypic variation and are ideal models relevant to diverse areas of biomedical research. With improving sequencing technologies, numerous domestic-animal lncRNAs are now available. Thus, there is an immediate need for a database resource that can assist researchers to store, organize, analyze and visualize domestic-animal lncRNAs.ResultsThe domestic-animal lncRNA database, named ALDB, is the first comprehensive database with a focus on the domestic-animal lncRNAs. It currently archives 12,103 pig intergenic lncRNAs (lincRNAs), 8,923 chicken lincRNAs and 8,250 cow lincRNAs. In addition to the annotations of lincRNAs, it offers related data that is not available yet in existing lncRNA databases (lncRNAdb and NONCODE), such as genome-wide expression profiles and animal quantitative trait loci (QTLs) of domestic animals. Moreover, a collection of interfaces and applications, such as the Basic Local Alignment Search Tool (BLAST), the Generic Genome Browser (GBrowse) and flexible search functionalities, are available to help users effectively explore, analyze and download data related to domestic-animal lncRNAs.ConclusionsALDB enables the exploration and comparative analysis of lncRNAs in domestic animals. A user-friendly web interface, integrated information and tools make it valuable to researchers in their studies. ALDB is freely available from http://res.xaut.edu.cn/aldb/index.jsp.
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on July 31,2025. A scientific community-crowdsourced database containing the RNA secondary structures of known types and organisms. It is meant to provide a simple and powerful way to analyze, search and update a shared repository of information.
It is intended to provide information on the sequences and functions of transcripts which do not code for proteins, but perform regulatory roles in the cell. Currently, the database includes over 30,000 individual sequences from 99 species of Bacteria, Archaea and Eukaryota. The primary source of sequences included in the database was the GenBank. Additional annotation information for mouse and human ncRNAs was derived from FANTOM3 database and H-inviational Integrated Database of Annotated Human Genes version 3.4, respectively. Genome mapping information was derived from tha data available at the UCSC Genome Browser site. The sequences and annotations of small cytoplasmic RNAs from bacteria, for which annotation is lacking in the genome sequences, were derived from the Rfam database. The microRNAs or snoRNAs which were available in previous editions, as well as other housekeeping (infrastructural) RNAs (e.g. rRNA, tRNA, snRNA, SRP RNA) are not included in our database to avoid redundancy with more specialized databases which emerged in recent years.