100+ datasets found

e
Q13316
ebi.ac.uk
Updated Apr 17, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2013). Q13316 [Dataset]. https://www.ebi.ac.uk/interpro/protein/reviewed/Q13316
Explore at:
Dataset updated
Apr 17, 2013
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
May have a dual function during osteoblast differentiation. In the nucleus of undifferentiated osteoblasts, unphosphorylated form acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the osteoblast to osteocyte transition phase it is phosphorylated and exported into the extracellular matrix, where it regulates nucleation of hydroxyapatite
INSDC Sequences
gbif.org
Updated Mar 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Bioinformatics Institute (EMBL-EBI); European Bioinformatics Institute (EMBL-EBI) (2025). INSDC Sequences [Dataset]. http://doi.org/10.15468/sbmztx
Explore at:
Unique identifier
https://doi.org/10.15468/sbmztx
Dataset updated
Mar 15, 2025
Dataset provided by
Global Biodiversity Information Facilityhttps://www.gbif.org/
European Bioinformatics Institutehttp://www.ebi.ac.uk/
Authors
European Bioinformatics Institute (EMBL-EBI); European Bioinformatics Institute (EMBL-EBI)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Description
This dataset contains INSDC sequence records not associated with environmental sample identifiers or host organisms. The dataset is prepared periodically using the public ENA API (https://www.ebi.ac.uk/ena/portal/api/) by querying data with search parameters: `environmental_sample=False & host=""`
EMBL-EBI also publishes other records in separate datasets (https://www.gbif.org/publisher/ada9d123-ddb4-467d-8891-806ea8d94230).
The data was then processed as follows:
1. Human sequences were excluded.
2. For non-CONTIG records, the sample accession number (when available) along with the scientific name were used to identify sequence records corresponding to the same individuals (or group of organism of the same species in the same sample). Only one record was kept for each scientific name/sample accession number.
3. Contigs and whole genome shotgun (WGS) records were added individually.
4. The records that were missing some information were excluded. Only records associated with a specimen voucher or records containing both a location AND a date were kept.
5. The records associated with the same vouchers are aggregated together.
6. A lot of records left corresponded to individual sequences or reads corresponding to the same organisms. In practise, these were "duplicate" occurrence records that weren't filtered out in STEP 2 because the sample accession sample was missing. To identify those potential duplicates, we grouped all the remaining records by `scientific_name`, `collection_date`, `location`, `country`, `identified_by`, `collected_by` and `sample_accession` (when available). Then we excluded the groups that contained more than 50 records. The rationale behind the choice of threshold is explained here: https://github.com/gbif/embl-adapter/issues/10#issuecomment-855757978
7. To improve the matching of the EBI scientific name to the GBIF backbone taxonomy, we incorporated the ENA taxonomic information. The kingdom, Phylum, Class, Order, Family, and genus were obtained from the ENA taxonomy checklist available here: http://ftp.ebi.ac.uk/pub/databases/ena/taxonomy/sdwca.zip
More information available here: https://github.com/gbif/embl-adapter#readme
You can find the mapping used to format the EMBL data to Darwin Core Archive here: https://github.com/gbif/embl-adapter/blob/master/DATAMAPPING.md
e
Data from: Competitive binding of STATs to receptor phospho-Tyr motifs...
ebi.ac.uk
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stephan Wilmes, Competitive binding of STATs to receptor phospho-Tyr motifs accounts for altered cytokine responses in autoimmune disorders [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD024188
Explore at:
Authors
Stephan Wilmes
Variables measured
Proteomics
Description
In this study, we compared the effects of two cytokine treatments on the proteome of human Th-1 cells. We used saturating doses of murine single-chain IL-27 (EBI3+p28, 10nM) and HyperIL-6 (20nM) and continuously stimulated cells of three donors with the two cytokines for 24h or left untreated.
ChEMBL EBI Small Molecules Database
kaggle.com
zip
Updated Feb 12, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Google BigQuery (2019). ChEMBL EBI Small Molecules Database [Dataset]. https://www.kaggle.com/bigquery/ebi-chembl
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Feb 12, 2019
Dataset provided by
BigQueryhttps://cloud.google.com/bigquery
Authors
Google BigQuery
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Context

ChEMBL is maintained by the European Bioinformatics Institute (EBI), of the European Molecular Biology Laboratory (EMBL), based at the Wellcome Trust Genome Campus, Hinxton, UK.

Content

ChEMBL is a manually curated database of bioactive molecules with drug-like properties used in drug discovery, including information about existing patented drugs.

Schema: http://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/releases/chembl_23/chembl_23_schema.png

Documentation: http://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/releases/chembl_23/schema_documentation.html

Fork this notebook to get started on accessing data in the BigQuery dataset using the BQhelper package to write SQL queries.

Acknowledgements

“ChEMBL” by the European Bioinformatics Institute (EMBL-EBI), used under CC BY-SA 3.0. Modifications have been made to add normalized publication numbers.

Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:ebi_chembl

Banner photo by rawpixel on Unsplash
INSDC Host Organism Sequences
gbif.org
Updated Mar 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Bioinformatics Institute (EMBL-EBI); European Bioinformatics Institute (EMBL-EBI) (2025). INSDC Host Organism Sequences [Dataset]. http://doi.org/10.15468/e97kmy
Explore at:
Unique identifier
https://doi.org/10.15468/e97kmy
Dataset updated
Mar 22, 2025
Dataset provided by
Global Biodiversity Information Facilityhttps://www.gbif.org/
European Bioinformatics Institutehttp://www.ebi.ac.uk/
Authors
European Bioinformatics Institute (EMBL-EBI); European Bioinformatics Institute (EMBL-EBI)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered

Description
This dataset contains INSDC sequences associated with host organisms. The dataset is prepared periodically using the public ENA API (https://www.ebi.ac.uk/ena/portal/api/) using the methods described below.
EMBL-EBI also publishes other records in separate datasets (https://www.gbif.org/publisher/ada9d123-ddb4-467d-8891-806ea8d94230).
The data was then processed as follows:
1. Human sequences were excluded.
2. For non-CONTIG records, the sample accession number (when available) along with the scientific name were used to identify sequence records corresponding to the same individuals (or group of organism of the same species in the same sample). Only one record was kept for each scientific name/sample accession number.
3. Contigs and whole genome shotgun (WGS) records were added individually.
4. The records that were missing some information were excluded. Only records associated with a specimen voucher or records containing both a location AND a date were kept.
5. The records associated with the same vouchers are aggregated together.
6. A lot of records left corresponded to individual sequences or reads corresponding to the same organisms. In practise, these were "duplicate" occurrence records that weren't filtered out in STEP 2 because the sample accession sample was missing. To identify those potential duplicates, we grouped all the remaining records by `scientific_name`, `collection_date`, `location`, `country`, `identified_by`, `collected_by` and `sample_accession` (when available). Then we excluded the groups that contained more than 50 records. The rationale behind the choice of threshold is explained here: https://github.com/gbif/embl-adapter/issues/10#issuecomment-855757978
7. To improve the matching of the EBI scientific name to the GBIF backbone taxonomy, we incorporated the ENA taxonomic information. The kingdom, Phylum, Class, Order, Family, and genus were obtained from the ENA taxonomy checklist available here: http://ftp.ebi.ac.uk/pub/databases/ena/taxonomy/sdwca.zip
More information available here: https://github.com/gbif/embl-adapter#readme
You can find the mapping used to format the EMBL data to Darwin Core Archive here: https://github.com/gbif/embl-adapter/blob/master/DATAMAPPING.md
E
Blueprint: A human variation panel of genetic influences on epigenomes and...
ega-archive.org
Updated Sep 26, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Blueprint: A human variation panel of genetic influences on epigenomes and transcriptomes in three immune cell types, (ChIP-Seq for CD4-positive, alpha-beta T cell, on genome GRCh37) [Dataset]. https://ega-archive.org/datasets/EGAD00001002673
Explore at:
Dataset updated
Sep 26, 2016
License
https://ega-archive.org/dacs/EGAC00001000135https://ega-archive.org/dacs/EGAC00001000135
Description
ChIP-Seq data for 154 CD4-positive, alpha-beta T cell sample(s). 355 run(s), 265 experiment(s), 250 analysis(s) on human genome GRCh37. Analysis documentation available at http://ftp.ebi.ac.uk/pub/databases/blueprint/blueprint_Epivar/protocols/README_chipseq_analysis_ebi_20160816
e
A0A0H2VIQ5
ebi.ac.uk
Updated Feb 1, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2020). A0A0H2VIQ5 [Dataset]. https://www.ebi.ac.uk/interpro/protein/A0A0H2VIQ5
Explore at:
Dataset updated
Feb 1, 2020
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Catalyzes the oxidation of glucose 6-phosphate to 6-phosphogluconolactone
i
M-CSA
integbio.jp
Updated Mar 31, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Bioinformatics Institute (2020). M-CSA [Dataset]. https://integbio.jp/dbcatalog/en/record/nbdc00032?jtpl=56
Explore at:
Dataset updated
Mar 31, 2020
Dataset provided by
European Bioinformatics Institute
License
http://www.ebi.ac.uk/about/terms-of-usehttp://www.ebi.ac.uk/about/terms-of-use
Description
M-CSA is a database about enzyme reaction. It provides annotation on the protein, catalytic residues, cofactors, and the reaction mechanisms of enzymes. Each record contains references about protein and structure (sequence, biological species, PDB, Catalytic CATH Domains), enzyme reaction and enzyme mechanisms. This database represents a unified resource that combines the data in both MACiE (http://www.ebi.ac.uk/thornton-srv/databases/MACiE/) and the CSA (http://www.ebi.ac.uk/thornton-srv/databases/CSA/).
e
PTS_IIA_man
ebi.ac.uk
Updated Jun 14, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). PTS_IIA_man [Dataset]. https://www.ebi.ac.uk/interpro/entry/cdd/
Explore at:
Dataset updated
Jun 14, 2016
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data item of the type domain from the database cdd with accession cd00006 and name PTS_IIA_man
Curated GWAS summary statistics on East Asian ancestry on 19 blood count...
zenodo.org
data.niaid.nih.gov
bin
Updated Jun 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lucie Troubat; Hanna Julienne; Hanna Julienne; Lucie Troubat (2023). Curated GWAS summary statistics on East Asian ancestry on 19 blood count traits and glycemic traits [Dataset]. http://doi.org/10.5281/zenodo.8068881
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8068881
Dataset updated
Jun 25, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Lucie Troubat; Hanna Julienne; Hanna Julienne; Lucie Troubat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Genome wide curated summary statistics on 19 blood count traits and glycemic traits

File format is the inittable format intended to be used with the Joint Analysis of Summary Statistics (JASS), which allows to perform multi-trait GWAS:

https://gitlab.pasteur.fr/statistical-genetics/jass

GWAS of hematological traits originate from Chen et al paper and were downloaded from the GWAS Catalog (https://www.ebi.ac.uk/gwas/publications/32888493#study_panel). GWAS of glycemic traits come from the (18) study downloadable from GWAS Catalog (https://www.ebi.ac.uk/gwas/publications/34059833).

Full description of the method used to derive this dataset can be found in
z
Aligned Long-Read Murine Samples: Part 14
zenodo.org
bin, html, zip
Updated Aug 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Theodore Nelson; Theodore Nelson (2024). Aligned Long-Read Murine Samples: Part 14 [Dataset]. http://doi.org/10.5281/zenodo.13336923
Explore at:
zip, bin, htmlAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.13336923
Dataset updated
Aug 18, 2024
Dataset provided by
Columbia University
Authors
Theodore Nelson; Theodore Nelson
Description
Accession numbers can be queried via the European Nucleotide Archive for metadata information: https://www.ebi.ac.uk/ena

Samples analyzed via the original version of L-RAPiT:

Nelson, T.M.; Ghosh, S.; Postler, T.S. L-RAPiT: A Cloud-Based Computing Pipeline for the Analysis of Long-Read RNA Sequencing Data. Int. J. Mol. Sci. 2022, 23, 15851. https://doi.org/10.3390/ijms232415851
n
Data from: CluSTr
neuinfo.org
dknet.org
+1more
Updated Sep 7, 2012
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2012). CluSTr [Dataset]. http://identifiers.org/RRID:SCR_007600
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_007600 https://identifiers.org/RRID:SCR_007600/resolver?q=&i=rrid
Dataset updated
Sep 7, 2012
Description
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone., documented June 24, 2013 as per the Miriam database (http://www.ebi.ac.uk/miriam/main/collections/MIR:00000021). The CluSTr database offers an automatic classification of UniProt Knowledgebase and IPI proteins into groups of related proteins. The clustering is based on analysis of all pairwise comparisons between protein sequences. The database provides links to InterPro, which integrates information on protein families, domains and functional sites from PROSITE, PRINTS, Pfam, ProDom, SMART, TIGRFAMs, Gene3D, SUPERFAMILY, PIR Superfamily and PANTHER. To date (2011), CluSTr contains the following information: * 9,450,285 sequences from UniProt Knowledgebase release 15.6 * 308,281 sequences from IPI * 3,636,831,744 similarities, with pairwise alignments generated on-the-fly * 17,616,060 clusters * Clustering for 972 organisms with completely sequenced genomes. For the full list of the genomes see Integr8 * Putative homologues predictions for the above species. For more information see Homologue Selection at Integr8
e
SUPERFAMILY
ebi.ac.uk
Updated Apr 30, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2020). SUPERFAMILY [Dataset]. https://www.ebi.ac.uk/interpro/entry/ssf/
Explore at:
Dataset updated
Apr 30, 2020
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset of the type entry from the database SUPERFAMILY - version 1.75
e
RNA-Seq count data from fresh oral an oropharyngeal tissue samples from head...
ebi.ac.uk
Updated Apr 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Katarina Mandić; Ivan Sabol (2024). RNA-Seq count data from fresh oral an oropharyngeal tissue samples from head and neck squamous cell carcinoma patients and fresh tonsil tissue samples from control patients [Dataset]. https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-13725
Explore at:
Dataset updated
Apr 21, 2024
Authors
Katarina Mandić; Ivan Sabol
Description
Integration of transcriptome with miRNome and methylome data aiming to elucidate role of epigenetics on gene expression in head and neck squamous cell carcinoma.
e
RNA sequencing of conventional and reset human pluripotent stem cells
ebi.ac.uk
omicsdi.org
Updated Sep 9, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paul Bertone (2014). RNA sequencing of conventional and reset human pluripotent stem cells [Dataset]. https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-2857/
Explore at:
Dataset updated
Sep 9, 2014
Authors
Paul Bertone
Description
Human pluripotent cells were reset to ground state pluripotency by transient overexpression of NANOG and KLF2 and subsequent inhibition of ERK and protein kinase C. Transcriptional profiling of reset cells and conventional pluripotent stem cell cultures was carried out by RNA-seq, in tandem with mouse embryonic stem cells propagated under similar conditions to assess the combinatorial effects of MEK inhibitor PD0325901, GSK3 inhibitor CHIR99021 and PKC inhibitor Go6983.
e
Data from: Neutrophil-derived migrasomes are an essential part of the...
ebi.ac.uk
Updated Jul 5, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dong Jiang (2024). Neutrophil-derived migrasomes are an essential part of the coagulation system [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD051238
Explore at:
Dataset updated
Jul 5, 2024
Authors
Dong Jiang
Variables measured
Proteomics
Description
Neutrophil-derived migrasomes are an essential part of the coagulation system, Mouse lipidomics
e
PRINTS
ebi.ac.uk
Updated Apr 8, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2013). PRINTS [Dataset]. https://www.ebi.ac.uk/interpro/entry/prints/
Explore at:
Dataset updated
Apr 8, 2013
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset of the type entry from the database PRINTS - version 42.0
e
Protein of unknown function DUF6083
ebi.ac.uk
Updated Nov 9, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Protein of unknown function DUF6083 [Dataset]. https://www.ebi.ac.uk/interpro/entry/InterPro/?search=DUF608
Explore at:
Dataset updated
Nov 9, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data item of the type family from the database interpro with accession IPR045729 and name Protein of unknown function DUF6083
e
Data from: Evolution of enhanced innate immune evasion by the SARS-CoV-2...
ebi.ac.uk
Updated Jan 20, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mehdi Bouhaddou (2022). Evolution of enhanced innate immune evasion by the SARS-CoV-2 B.1.1.7 UK variant [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD026302
Explore at:
Dataset updated
Jan 20, 2022
Authors
Mehdi Bouhaddou
Area covered
United Kingdom
Variables measured
Proteomics
Description
Here we use unbiased abundance proteomics and phosphoproteomics to assess global changes to host and viral proteins in Calu-3 cells at 10 and 24 hours post infection with either the B.1.1.7 UK variant or early-lineage SARS-CoV-2 viruses VIC and IC19.
e
Domain of unknown function DUF6299
ebi.ac.uk
Updated Nov 8, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Domain of unknown function DUF6299 [Dataset]. https://www.ebi.ac.uk/interpro/entry/InterPro/?search=DUF629
Explore at:
Dataset updated
Nov 8, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data item of the type domain from the database interpro with accession IPR046266 and name Domain of unknown function DUF6299

Facebook

Twitter

Click to copy link

Link copied

Cite

(2013). Q13316 [Dataset]. https://www.ebi.ac.uk/interpro/protein/reviewed/Q13316

Q13316

Explore at:

24 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Apr 17, 2013

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

May have a dual function during osteoblast differentiation. In the nucleus of undifferentiated osteoblasts, unphosphorylated form acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the osteoblast to osteocyte transition phase it is phosphorylated and exported into the extracellular matrix, where it regulates nucleation of hydroxyapatite

Clear search

Close search

Google apps

Main menu

Q13316

INSDC Sequences

Data from: Competitive binding of STATs to receptor phospho-Tyr motifs...

ChEMBL EBI Small Molecules Database

Context

Content

Acknowledgements

INSDC Host Organism Sequences

Blueprint: A human variation panel of genetic influences on epigenomes and...

A0A0H2VIQ5

M-CSA

PTS_IIA_man

Curated GWAS summary statistics on East Asian ancestry on 19 blood count...

Aligned Long-Read Murine Samples: Part 14

Data from: CluSTr

SUPERFAMILY

RNA-Seq count data from fresh oral an oropharyngeal tissue samples from head...

RNA sequencing of conventional and reset human pluripotent stem cells

Data from: Neutrophil-derived migrasomes are an essential part of the...

PRINTS

Protein of unknown function DUF6083

Data from: Evolution of enhanced innate immune evasion by the SARS-CoV-2...

Domain of unknown function DUF6299

Q13316