100+ datasets found
  1. f

    STRING Network Analysis

    • figshare.com
    Updated May 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dain Lee (2025). STRING Network Analysis [Dataset]. http://doi.org/10.6084/m9.figshare.29126396.v2
    Explore at:
    Dataset updated
    May 22, 2025
    Dataset provided by
    figshare
    Authors
    Dain Lee
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This file contains the protein-protein interaction analysis dataset that was used in the unpublished manuscript and was further analyzed with the STRING online software.Significantly upregulated mRNAs (2,777 genes; p < 0.05) identified by bulk RNA-seq were analyzed using the STRING module in Cytoscape v.2.2.0 (Institute for System Biology; WA; USA). A cluster network was constructed using the MCL algorithm with a granularity parameter of 4, followed by filtering nodes with mcl.cluster > 10. The resulting 1,848 nodes were processed through STRING v12.0 (Swiss Institute of Bioinformatics; Lausanne; Switzerland) to generate a protein–protein interaction (PPI) network, incorporating evidence from text mining, genomic neighborhood, experimental data, curated databases, co-expression, gene fusion, and co-occurrence, with a minimum confidence score threshold of 0.40. Network modules were defined using the DBSCAN clustering algorithm with an ε parameter of 2. Cluster 1, representing the largest gene set (101 genes), was further analyzed by sorting the top 20 nodes with the highest node degree, resulting in a network comprising 101 nodes and 756 edges. Global network metrics indicated an average node degree of 15, a local clustering coefficient of 0.600, and a PPI enrichment p-value of < 1 × 10⁻¹⁶. The average values of coexpression, experimentally determined interactions, automated text mining, and combined scores were calculated.

  2. f

    STRING protein-protein interaction networks for WT-C vs. WT-D.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Parisa Sooshtari; Biao Feng; Saumik Biswas; Michael Levy; Hanxin Lin; Zhaoliang Su; Subrata Chakrabarti (2023). STRING protein-protein interaction networks for WT-C vs. WT-D. [Dataset]. http://doi.org/10.1371/journal.pone.0270287.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 14, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Parisa Sooshtari; Biao Feng; Saumik Biswas; Michael Levy; Hanxin Lin; Zhaoliang Su; Subrata Chakrabarti
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    STRING protein-protein interaction networks for WT-C vs. WT-D.

  3. f

    Validation of the total new predicted links and the new predicted links...

    • plos.figshare.com
    xls
    Updated Jun 5, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wei Zhang; Jia Xu; Yuanyuan Li; Xiufen Zou (2023). Validation of the total new predicted links and the new predicted links associated with the 10 proteins by STRING database for the 14317_PPI data. [Dataset]. http://doi.org/10.1371/journal.pone.0177029.t009
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Wei Zhang; Jia Xu; Yuanyuan Li; Xiufen Zou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Validation of the total new predicted links and the new predicted links associated with the 10 proteins by STRING database for the 14317_PPI data.

  4. f

    Statistics of the genes in the protein interaction network constructed based...

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shunyao Wu; Fengjing Shao; Jun Ji; Rencheng Sun; Rizhuang Dong; Yuanke Zhou; Shaojie Xu; Yi Sui; Jianlong Hu (2023). Statistics of the genes in the protein interaction network constructed based on the STRING database. [Dataset]. http://doi.org/10.1371/journal.pone.0116505.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Shunyao Wu; Fengjing Shao; Jun Ji; Rencheng Sun; Rizhuang Dong; Yuanke Zhou; Shaojie Xu; Yi Sui; Jianlong Hu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Statistics of the genes in the protein interaction network constructed based on the STRING database.

  5. f

    Unknown genes and genes without any interactions in STRING in predicted T....

    • datasetcatalog.nlm.nih.gov
    Updated Sep 28, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jäntti, Jussi; Castillo, Sandra; Pakula, Tiina; Oja, Merja; Penttilä, Merja; Arvas, Mikko; Kludas, Jana; Rousu, Juho; Brouard, Céline (2016). Unknown genes and genes without any interactions in STRING in predicted T. reesei secretion network. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001517561
    Explore at:
    Dataset updated
    Sep 28, 2016
    Authors
    Jäntti, Jussi; Castillo, Sandra; Pakula, Tiina; Oja, Merja; Penttilä, Merja; Arvas, Mikko; Kludas, Jana; Rousu, Juho; Brouard, Céline
    Description

    Column ‘Gene’ contains the T. reesei gene ID. ‘In STRING’ tells if the gene has interactions in STRING. Columns ‘Btw’ and ‘Deg’ denote the betweenness and degree network statistics of the corresponding gene. Columns ‘Class’ and ‘Putative secretion pathway component’ are author assigned classifications. ‘Taxon specificity’ gives the largest taxonomic group the gene was found in.

  6. f

    Basic information of the four original networks (HIPPIE, HumanNet, FunCoup...

    • datasetcatalog.nlm.nih.gov
    Updated Dec 22, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang, Jian; Lin, Limei; Yang, Fan; Wu, Duzhi; Yang, Tinghong; Zhao, Jing (2017). Basic information of the four original networks (HIPPIE, HumanNet, FunCoup and STRING) and the GO network. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001798563
    Explore at:
    Dataset updated
    Dec 22, 2017
    Authors
    Yang, Jian; Lin, Limei; Yang, Fan; Wu, Duzhi; Yang, Tinghong; Zhao, Jing
    Description

    Basic information of the four original networks (HIPPIE, HumanNet, FunCoup and STRING) and the GO network.

  7. Supplemental Table S5

    • figshare.com
    docx
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abraham Moller (2023). Supplemental Table S5 [Dataset]. http://doi.org/10.6084/m9.figshare.13355945.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Abraham Moller
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summary statistics for protein-protein interaction networks identified with STRING amongst genes corresponding to significant SNPs or k-mers (inside or adjacent to genes). PPI enrichment p-value corresponds to the likelihood nodes and edges would be selected from the S. aureus database by chance.

  8. Citation network of the knowledge co-production literature. Supplementary...

    • zenodo.org
    • data.niaid.nih.gov
    csv
    Updated Dec 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Justyna Bandola-Gill; Justyna Bandola-Gill; Megan Arthur; Megan Arthur; Rhodri Ivor Leng; Rhodri Ivor Leng (2021). Citation network of the knowledge co-production literature. Supplementary data. [Dataset]. http://doi.org/10.5281/zenodo.5762451
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 8, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Justyna Bandola-Gill; Justyna Bandola-Gill; Megan Arthur; Megan Arthur; Rhodri Ivor Leng; Rhodri Ivor Leng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data description

    This data note describes the final citation network dataset analysed in the manuscript "What is co-production? Conceptualising and understanding co-production of knowledge and policy across different theoretical perspectives’"[1].

    The data collection strategy used to construct the following dataset can be found in the associated manuscript [1]. These data were originally downloaded from the Web of Science (WoS) Core Collection via the library subscription of the University of Edinburgh via a systematic search methodology that sought to capture literature relevant to ‘knowledge co-production’. The dataset consists of 1,893 unique document reference strings (nodes) interlinked together by 9,759 citation links (edges). The network dataset describes a directed citation network composed of papers relevant to 'knowledge co-production', and is split into two files: (i) ‘KnowCo_node_attribute_list.csv’ contains attributes of the 1,893 documents (nodes); and (ii) ‘KnowCo_edge_list.csv’ records the citation links (edges) between pairs of documents.

    1. ‘KnowCo_node_attribute_list.csv’ consists of attributes of the 1,893 nodes (documents) of the citation network. Due to the approach used to collect data, there are two types of node: (i) 525 nodes represent documents retrieved from WoS via the systematic search strategy, and these have full attribute data including their reference lists; and (ii) 1,368 documents that were cited >2 times by our 525 fully retrieved papers (see manuscript for full description [1]). The columns refer to:

    Id, the unique identifier. Fully retrieved documents are identified via a unique identifier that begins with ‘f’ followed by an integer (e.g. f1, f2, etc.). Non-retrieved documents are identified via a unique identifier beginning with ‘n’ followed by an integer (e.g. n1, n2, etc.).

    Label, contains the unique reference string of the document for which the attribute data in that row corresponds. Reference strings contain the last name of the first author, publication year, journal, volume, start page, and DOI (if available).

    authors, all author names. These are in the order that these names appear in the authorship list of the corresponding document. These data are only available for fully retrieved documents.

    title, document title. These data are only available for fully retrieved documents.

    journal, journal of publication. These data are only available for fully retrieved documents. For those interested in journal data for the remaining papers, this can be extracted from the reference string in the ‘Label’ column.

    year, year of publication. These data are available for all nodes.

    type, document type (e.g. article, review). Available only for fully retrieved documents.

    wos_total_citations, total citation count as recorded by Web of Science Core Collection as of May 2020. Available only for fully retrieved documents.

    wos_id, Web of Science accession number. Available only for fully retrieved documents only, for non-retrieved documents ‘CitedReference’ fills the cell.

    cluster, provides the cluster membership number as discussed within the manuscript, established via modularity maximisation via the Leiden algorithm (Res 0.8; Q=0.53|5 clusters). Available for all nodes.

    indegree, total count of within network citations to a given document. Due to the composition of the network, this figure tells us the total number of citations from 525 fully retrieved documents to each of the 1,893 documents within the network. Available for all nodes.

    outdegree, total count of within network references from a given document. Due to the composition of the network, only fully retrieved documents can have a value >0 because only these documents have their associated reference list data. Available for all nodes.

    2. ‘KnowCo_edge _list.csv’ is an edge list containing 9,759 citation links between the 1,893 documents. The columns refer to:

    Source, the citing document’s unique identifier.

    Target, the cited document’s unique identifier.

    Notes

    [1] Bandola-Gill, J., Arthur, M., & Leng, R. I. (Under review). What is co-production? Conceptualising and understanding co-production of knowledge and policy across different theoretical perspectives. Evidence & Policy

  9. w

    CreativeWork

    • pfocr.wikipathways.org
    Updated Jun 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WikiPathways (2023). CreativeWork [Dataset]. https://pfocr.wikipathways.org/figures/PMC10242111_fcell-11-1165308-g004.html
    Explore at:
    Dataset updated
    Jun 9, 2023
    Dataset authored and provided by
    WikiPathways
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Protein–protein interaction network of the top differentially expressed genes between the patient’s samples and the Ctrl cohort. Edges represent protein–protein associations. Confidence ≥0.700; maximum number of interactors ≤20. Edge confidence: high (0.700) and highest (0.900) (see https://string-db.org/cgi/network).

  10. Results on scGEM with ground truth determined by (1) cell-type specific...

    • plos.figshare.com
    xls
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wenjun Zhao; Erica Larschan; Björn Sandstede; Ritambhara Singh (2025). Results on scGEM with ground truth determined by (1) cell-type specific networks validated by experimental data [44] and (2) STRING [45]. The random baseline for AUPRC is 0.020 for the experimental network and 0.058 for the STRING network. [Dataset]. http://doi.org/10.1371/journal.pcbi.1012476.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 28, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Wenjun Zhao; Erica Larschan; Björn Sandstede; Ritambhara Singh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Results on scGEM with ground truth determined by (1) cell-type specific networks validated by experimental data [44] and (2) STRING [45]. The random baseline for AUPRC is 0.020 for the experimental network and 0.058 for the STRING network.

  11. Selection of 30 central genes from PPI network, including 17 upregulated and...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rong Fan; Lijin Dong; Ping Li; Xiaoming Wang; Xuewei Chen (2023). Selection of 30 central genes from PPI network, including 17 upregulated and 13 downregulated genes, by using the STRING and Cytoscape software. [Dataset]. http://doi.org/10.1371/journal.pone.0251962.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Rong Fan; Lijin Dong; Ping Li; Xiaoming Wang; Xuewei Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Selection of 30 central genes from PPI network, including 17 upregulated and 13 downregulated genes, by using the STRING and Cytoscape software.

  12. d

    Data from: Determining the minimum number of protein-protein interactions...

    • dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated Apr 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natsu Nakajima; Morihiro Hayashida; Jesper Jansson; Osamu Maruyama; Tatsuya Akutsu (2025). Determining the minimum number of protein-protein interactions required to support known protein complexes [Dataset]. http://doi.org/10.5061/dryad.8s3682g
    Explore at:
    Dataset updated
    Apr 16, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Natsu Nakajima; Morihiro Hayashida; Jesper Jansson; Osamu Maruyama; Tatsuya Akutsu
    Time period covered
    Apr 30, 2018
    Description

    The prediction of protein complexes from protein-protein interactions (PPIs) is a well-studied problem in bioinformatics. However, the currently available PPI data is not enough to describe all known protein complexes. In this paper, we express the problem of determining the minimum number of (additional) required protein-protein interactions as a graph theoretic problem under the constraint that each complex constitutes a connected component in a PPI network. For this problem, we develop two computational methods: one is based on integer linear programming (ILPMinPPI) and the other one is based on an existing greedy-type approximation algorithm (GreedyMinPPI) originally developed in the context of communication and social networks. Since the former method is only applicable to datasets of small size, we apply the latter method to a combination of the CYC2008 protein complex dataset and each of eight PPI datasets (STRING, MINT, BioGRID, IntAct, DIP, BIND, WI-PHI, iRefIndex). The results...

  13. w

    Data from: VERY HIGH-SPEED DRILL STRING COMMUNICATIONS NETWORK

    • data.wu.ac.at
    Updated Sep 29, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). VERY HIGH-SPEED DRILL STRING COMMUNICATIONS NETWORK [Dataset]. https://data.wu.ac.at/schema/edx_netl_doe_gov/NDNjNzUzN2MtZGNjOS00NDFmLTliNmUtZWJhN2E0M2I3MDZh
    Explore at:
    Dataset updated
    Sep 29, 2016
    Description

    A history and project summary of the development of a very high-speed drill string communications network are given. The summary includes laboratory and field test results, including recent successes of the system in wells in Oklahoma. A brief explanation of commercialization plans is included. The primary conclusion for this work is that a high data rate communications system can be made sufficiently robust, reliable, and transparent to the end user to be successfully deployed in a down-hole drilling environment. A secondary conclusion is that a networking system with user data bandwidth of at least 1 million bits per second can be built to service any practical depth of well using multiple repeaters (Links), with spacing between the Links of at least 1000 ft.

  14. f

    Summary of GO term-centric results obtained by different network embedding...

    • plos.figshare.com
    xls
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cen Wan; Domenico Cozzetto; Rui Fa; David T. Jones (2023). Summary of GO term-centric results obtained by different network embedding representations and corresponding functional representations based on Combinedscore, Textmining, Experimental, Database and Coexpression networks working with different classification algorithms during hold-out evaluation. [Dataset]. http://doi.org/10.1371/journal.pone.0209958.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Cen Wan; Domenico Cozzetto; Rui Fa; David T. Jones
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summary of GO term-centric results obtained by different network embedding representations and corresponding functional representations based on Combinedscore, Textmining, Experimental, Database and Coexpression networks working with different classification algorithms during hold-out evaluation.

  15. d

    ProtChemSI

    • dknet.org
    • scicrunch.org
    • +2more
    Updated Sep 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). ProtChemSI [Dataset]. http://identifiers.org/RRID:SCR_006115
    Explore at:
    Dataset updated
    Sep 1, 2024
    Description

    The database of protein-chemical structural interactions includes all existing 3D structures of complexes of proteins with low molecular weight ligands. When one considers the proteins and chemical vertices of a graph, all these interactions form a network. Biological networks are powerful tools for predicting undocumented relationships between molecules. The underlying principle is that existing interactions between molecules can be used to predict new interactions. For pairs of proteins sharing a common ligand, we use protein and chemical superimpositions combined with fast structural compatibility screens to predict whether additional compounds bound by one protein would bind the other. The current version includes data from the Protein Data Bank as of August 2011. The database is updated monthly.

  16. e

    Data from: Plasma proteomics in epilepsy: network-based identification of...

    • ebi.ac.uk
    Updated Nov 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Liisa Arike (2024). Plasma proteomics in epilepsy: network-based identification of proteins associated with seizures [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD057292
    Explore at:
    Dataset updated
    Nov 19, 2024
    Authors
    Liisa Arike
    Variables measured
    Proteomics
    Description

    Purpose Identification of potential biomarkers of seizures. Methods In this exploratory study, we quantified plasma protein intensities in 15 patients with recent seizures compared to 15 patients with long-standing seizure freedom. Using TMT-based proteomics we found fifty-one differentially expressed proteins. Results Network analyses including co-expression networks and protein-protein interaction networks, using the STRING database, followed by network centrality and modularity analyses revealed 22 protein modules, with one module showing a significant association with seizures. The protein-protein interaction network centered around this module identified a subnetwork of 125 proteins, grouped into four clusters. Notably, one cluster (mainly enriching inflammatory pathways and Gene Ontology terms) demonstrated the highest enrichment of known epilepsy-related genes. Conclusion Overall, our network-based approach identified a protein module linked with seizures. The module contained known markers of epilepsy and inflammation. The results also demonstrate the potential of network analysis in discovering new biomarkers for improved epilepsy management.

  17. Z

    Evaluating homophily of human PPI with respect to chromosomes

    • data.niaid.nih.gov
    Updated Jul 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Blankenberg, Daniel (2022). Evaluating homophily of human PPI with respect to chromosomes [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6941314
    Explore at:
    Dataset updated
    Jul 30, 2022
    Dataset provided by
    Cumbo, Fabio
    Blankenberg, Daniel
    Apollonio, Nicola
    Franciosa, Paolo Giulio
    Santoni, Daniele
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Homophily/heterophily evaluation, expressed in terms of z-score values, is related to the human Protein-Protein Interaction Network (PPI), obtained from the STRING v11.5 database (https://string-db.org) setting standard threshold on edge score (T=700). Each protein occurring in the PPI was assigned to a class corresponding to the chromosome the related gene belongs to.

    A total of 23 classes (chr1, chr2, ..., chr22, chrX) were considered (excluding the class corresponding to chromosome Y because of the small number of genes occurring in the network).

    The homophily/heterophily nature of the network, with respect to chromosome classes, was evaluated through HONTO tool (https://github.com/cumbof/honto).

    In other words, the tendency of proteins to preferentially interact with proteins whose genes are physically located on the same chromosome (homophily) or on different chromosomes (heterophily) was investigated and evaluated in terms of z-scores.

    Values related to intra (along the diagonal) and inter chromosomal interactions (other than the diagonal) are also reported as a heatmap.

    As one can observe, values occurring in the diagonal are clearly higher than values out of the diagonal, leading to assess a homophilic nature of the network, confirming the link between shared chromosome and interaction in the PPI.

  18. Protein interaction data for 222 BM zone components

    • figshare.com
    xlsx
    Updated Feb 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mychel Morais; Ranjay Jayadev; Rachel Lennon; David Sherwood; Jamie Ellingford; Craig Lawless (2022). Protein interaction data for 222 BM zone components [Dataset]. http://doi.org/10.6084/m9.figshare.19127504.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 6, 2022
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Mychel Morais; Ranjay Jayadev; Rachel Lennon; David Sherwood; Jamie Ellingford; Craig Lawless
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All human protein interactions were obtained from STRING (https://string-db.org/, version 11.0). Interactions were then filtered to those involving only BM zone proteins. Related to Fig. S6B.

  19. Y

    Citation Network Graph

    • shibatadb.com
    Updated Apr 8, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yubetsu (2015). Citation Network Graph [Dataset]. https://www.shibatadb.com/article/xMJVKyFF
    Explore at:
    Dataset updated
    Apr 8, 2015
    Dataset authored and provided by
    Yubetsu
    License

    https://www.shibatadb.com/license/data/proprietary/v1.0/license.txthttps://www.shibatadb.com/license/data/proprietary/v1.0/license.txt

    Description

    Network of 45 papers and 108 citation links related to "Russian doll spectrum in a non-Abelian string-net ladder".

  20. f

    Table1_Predicting Human Protein Subcellular Locations by Using a Combination...

    • frontiersin.figshare.com
    xlsx
    Updated Jun 3, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lei Chen; ZhanDong Li; Tao Zeng; Yu-Hang Zhang; ShiQi Zhang; Tao Huang; Yu-Dong Cai (2023). Table1_Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features.XLSX [Dataset]. http://doi.org/10.3389/fgene.2021.783128.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Frontiers
    Authors
    Lei Chen; ZhanDong Li; Tao Zeng; Yu-Hang Zhang; ShiQi Zhang; Tao Huang; Yu-Dong Cai
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein–protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein–protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Dain Lee (2025). STRING Network Analysis [Dataset]. http://doi.org/10.6084/m9.figshare.29126396.v2

STRING Network Analysis

Explore at:
Dataset updated
May 22, 2025
Dataset provided by
figshare
Authors
Dain Lee
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This file contains the protein-protein interaction analysis dataset that was used in the unpublished manuscript and was further analyzed with the STRING online software.Significantly upregulated mRNAs (2,777 genes; p < 0.05) identified by bulk RNA-seq were analyzed using the STRING module in Cytoscape v.2.2.0 (Institute for System Biology; WA; USA). A cluster network was constructed using the MCL algorithm with a granularity parameter of 4, followed by filtering nodes with mcl.cluster > 10. The resulting 1,848 nodes were processed through STRING v12.0 (Swiss Institute of Bioinformatics; Lausanne; Switzerland) to generate a protein–protein interaction (PPI) network, incorporating evidence from text mining, genomic neighborhood, experimental data, curated databases, co-expression, gene fusion, and co-occurrence, with a minimum confidence score threshold of 0.40. Network modules were defined using the DBSCAN clustering algorithm with an ε parameter of 2. Cluster 1, representing the largest gene set (101 genes), was further analyzed by sorting the top 20 nodes with the highest node degree, resulting in a network comprising 101 nodes and 756 edges. Global network metrics indicated an average node degree of 15, a local clustering coefficient of 0.600, and a PPI enrichment p-value of < 1 × 10⁻¹⁶. The average values of coexpression, experimentally determined interactions, automated text mining, and combined scores were calculated.

Search
Clear search
Close search
Google apps
Main menu