100+ datasets found
  1. Extracted Schemas from the Life Sciences Linked Open Data Cloud

    • figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maulik Kamdar (2023). Extracted Schemas from the Life Sciences Linked Open Data Cloud [Dataset]. http://doi.org/10.6084/m9.figshare.12402425.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Maulik Kamdar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is related to the manuscript "An empirical meta-analysis of the life sciences linked open data on the web" published at Nature Scientific Data. If you use the dataset, please cite the manuscript as follows:Kamdar, M.R., Musen, M.A. An empirical meta-analysis of the life sciences linked open data on the web. Sci Data 8, 24 (2021). https://doi.org/10.1038/s41597-021-00797-yWe have extracted schemas from more than 80 publicly available biomedical linked data graphs in the Life Sciences Linked Open Data (LSLOD) cloud into an LSLOD schema graph and conduct an empirical meta-analysis to evaluate the extent of semantic heterogeneity across the LSLOD cloud. The dataset published here contains the following files:- The set of Linked Data Graphs from the LSLOD cloud from which schemas are extracted.- Refined Sets of extracted classes, object properties, data properties, and datatypes, shared across the Linked Data Graphs on LSLOD cloud. Where the schema element is reused from a Linked Open Vocabulary or an ontology, it is explicitly indicated.- The LSLOD Schema Graph, which contains all the above extracted schema elements interlinked with each other based on the underlying content. Sample instances and sample assertions are also provided along with broad level characteristics of the modeled content. The LSLOD Schema Graph is saved as a JSON Pickle File. To read the JSON object in this Pickle file use the Python command as follows:with open('LSLOD-Schema-Graph.json.pickle' , 'rb') as infile: x = pickle.load(infile, encoding='iso-8859-1')Check the Referenced Link for more details on this research, raw data files, and code references.

  2. d

    Long-Term ST Database (Split)

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qiu, Dicong (2023). Long-Term ST Database (Split) [Dataset]. http://doi.org/10.7910/DVN/HTQY5M
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Qiu, Dicong
    Description

    Long-Term ST Database carefully split into training and testing datasets.

  3. d

    Database

    • search.dataone.org
    Updated Nov 8, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akter, Shahinur (2023). Database [Dataset]. http://doi.org/10.7910/DVN/R31F6M
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Akter, Shahinur
    Description
  4. バイオサイエンスにおけるID

    • figshare.com
    pdf
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Toshiaki Katayama (2023). バイオサイエンスにおけるID [Dataset]. http://doi.org/10.6084/m9.figshare.6597509.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Toshiaki Katayama
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Presentation slides on the identifiers in biosciences at the Japan Open Science Summit (JOSS) 2018.

  5. BioSharing: examples of use in NIH BD2K CEDAR, BioCADDIE and ELIXIR

    • figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alejandra Gonzalez-Beltran; Peter McQuilton; Allyson L. Lister; Milo Thurston; Susanna-Assunta Sansone; Philippe Rocca-Serra (2023). BioSharing: examples of use in NIH BD2K CEDAR, BioCADDIE and ELIXIR [Dataset]. http://doi.org/10.6084/m9.figshare.1599797.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Alejandra Gonzalez-Beltran; Peter McQuilton; Allyson L. Lister; Milo Thurston; Susanna-Assunta Sansone; Philippe Rocca-Serra
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BioSharing (http://www.biosharing.org) is a curated, web-based, searchable portal of over 1,300 records describing content standards, databases and data policies in the life sciences, broadly encompassing the biological, natural and biomedical sciences. Among many features, the records can be searched and filtered, or grouped via the ‘Collection’ feature according to field of interest. An example is the Collection curated with the NIH BD2K bioCADDIE project, for various purposes. First, to select and track content standards that have been reviewed during the creation of the metadata model underpinning the Data Discovery Index. Second, as the work progresses and the prototype Index harvests dataset descriptors from different databases, the Collection will be extended to include the descriptions of these databases, including which (if any) standards they implement. This is key to support one of the bioCADDIE project use cases: to allow the searching and filtering of datasets that are compliant to a given community standard. Despite a growing set of standard guidelines and formats for describing their experiments, the barriers to authoring the experimental metadata necessary for sharing and interpreting datasets are tremendously high. Understanding how to comply with these standards takes time and effort and researchers view this as a burden that may benefit other scientists, but not themselves. To tackle this, with and for the NIH BD2K CEDAR project, we will explore methods to serve machine-readable versions of these standards that can inform the creation of metadata templates, rendering standards invisible to the researchers and driving them to strive for easier authoring of the experimental metadata. Lastly, as part of the ELIXIR-UK Node BioSharing is being developed to be the ELIXIR Standards Registry and will be progressively cross-linked to other registries, such as the ELIXIR Tools and Services Registry and the ELIXIR Training e-Support System (TeSS).

  6. i

    Life Sciences B2B Leads Database

    • introlynk.com
    Updated Dec 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IntroLynk (2025). Life Sciences B2B Leads Database [Dataset]. https://www.introlynk.com/b2b/life-sciences-leads/
    Explore at:
    Dataset updated
    Dec 8, 2025
    Dataset provided by
    IntroLynk
    Description

    Premium B2B leads from life sciences companies actively seeking business solutions. Access qualified prospects from biotechnology firms, research laboratories, genomics companies, and diagnostic organizations.

  7. H

    Databases for Formative Assessment of Knowledge, Attitudes, and Preferred...

    • dataverse.harvard.edu
    Updated Apr 30, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Catherine Todd (2019). Databases for Formative Assessment of Knowledge, Attitudes, and Preferred Media for Reproductive Health Engagement among Selected Groups of Youth and Men in Afghanistan [Dataset]. http://doi.org/10.7910/DVN/QULMNN
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 30, 2019
    Dataset provided by
    Harvard Dataverse
    Authors
    Catherine Todd
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Afghanistan
    Description

    Databases for three populations studied in above formative research in Stata 13. Final report document also attached with questionnaires as annex. Coding corresponds to question number.

  8. C

    Bioinformatics for Researchers in Life Sciences: Tools and Learning...

    • data.iadb.org
    csv, pdf
    Updated Apr 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IDB Datasets (2025). Bioinformatics for Researchers in Life Sciences: Tools and Learning Resources [Dataset]. http://doi.org/10.60966/kwvb-wr19
    Explore at:
    csv(276253), pdf(2989058), csv(355108)Available download formats
    Dataset updated
    Apr 10, 2025
    Dataset provided by
    IDB Datasets
    License

    Attribution-NonCommercial-NoDerivs 3.0 (CC BY-NC-ND 3.0)https://creativecommons.org/licenses/by-nc-nd/3.0/
    License information was derived automatically

    Time period covered
    Jan 1, 2020 - Jan 1, 2021
    Description

    The COVID-19 pandemic has shown that bioinformatics--a multidisciplinary field that combines biological knowledge with computer programming concerned with the acquisition, storage, analysis, and dissemination of biological data--has a fundamental role in scientific research strategies in all disciplines involved in fighting the virus and its variants. It aids in sequencing and annotating genomes and their observed mutations; analyzing gene and protein expression; simulation and modeling of DNA, RNA, proteins and biomolecular interactions; and mining of biological literature, among many other critical areas of research. Studies suggest that bioinformatics skills in the Latin American and Caribbean region are relatively incipient, and thus its scientific systems cannot take full advantage of the increasing availability of bioinformatic tools and data. This dataset is a catalog of bioinformatics software for researchers and professionals working in life sciences. It includes more than 300 different tools for varied uses, such as data analysis, visualization, repositories and databases, data storage services, scientific communication, marketplace and collaboration, and lab resource management. Most tools are available as web-based or desktop applications, while others are programming libraries. It also includes 10 suggested entries for other third-party repositories that could be of use.

  9. Research misconduct in health and life sciences research: A systematic...

    • plos.figshare.com
    pdf
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafaelly Stavale; Graziani Izidoro Ferreira; João Antônio Martins Galvão; Fábio Zicker; Maria Rita Carvalho Garbi Novaes; César Messias de Oliveira; Dirce Guilhem (2023). Research misconduct in health and life sciences research: A systematic review of retracted literature from Brazilian institutions [Dataset]. http://doi.org/10.1371/journal.pone.0214272
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Rafaelly Stavale; Graziani Izidoro Ferreira; João Antônio Martins Galvão; Fábio Zicker; Maria Rita Carvalho Garbi Novaes; César Messias de Oliveira; Dirce Guilhem
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundMeasures to ensure research integrity have been widely discussed due to the social, economic and scientific impact of research integrity. In the past few years, financial support for health research in emerging countries has steadily increased, resulting in a growing number of scientific publications. These achievements, however, have been accompanied by a rise in retracted publications followed by concerns about the quality and reliability of such publications.ObjectiveThis systematic review aimed to investigate the profile of medical and life sciences research retractions from authors affiliated with Brazilian academic institutions. The chronological trend between publication and retraction date, reasons for the retraction, citation of the article after the retraction, study design, and the number of retracted publications by author and affiliation were assessed. Additionally, the quality, availability and accessibility of data regarding retracted papers from the publishers are described.MethodsTwo independent reviewers searched for articles that had been retracted since 2004 via PubMed, Web of Science, Biblioteca Virtual em Saúde (BVS) and Google Scholar databases. Indexed keywords from Medical Subject Headings (MeSH) and Descritores em Ciências da Saúde (DeCS) in Portuguese, English or Spanish were used. Data were also collected from the Retraction Watch website (www.retractionwatch.com). This study was registered with the PROSPERO systematic review database (CRD42017071647).ResultsA final sample of 65 articles was retrieved from 55 different journals with reported impact factors ranging from 0 to 32.86, with a median value of 4.40 and a mean of 4.69. The types of documents found were erratum (1), retracted articles (3), retracted articles with a retraction notice (5), retraction notices with erratum (3), and retraction notices (45). The assessment of the Retraction Watch website added 8 articles that were not identified by the search strategy using the bibliographic databases. The retracted publications covered a wide range of study designs. Experimental studies (40) and literature reviews (15) accounted for 84.6% of the retracted articles. Within the field of health and life sciences, medical science was the field with the largest number of retractions (34), followed by biological sciences (17). Some articles were retracted for at least two distinct reasons (13). Among the retrieved articles, plagiarism was the main reason for retraction (60%). Missing data were found in 57% of the retraction notices, which was a limitation to this review. In addition, 63% of the articles were cited after their retraction.ConclusionPublications are not retracted solely for research misconduct but also for honest error. Nevertheless, considering authors affiliated with Brazilian institutions, this review concluded that most of the retracted health and life sciences publications were retracted due to research misconduct. Because the number of publications is the most valued indicator of scientific productivity for funding and career progression purposes, a systematic effort from the national research councils, funding agencies, universities and scientific journals is needed to avoid an escalating trend of research misconduct. More investigations are needed to comprehend the underlying factors of research misconduct and its increasing manifestation.

  10. U

    Replication data - Sampling sites, databases for SDMs and R scripts for MLPE...

    • dataverse.unimi.it
    application/gzip, txt
    Updated Jun 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francesco Ceresa; Mattia Brambilla; Mattia Brambilla; Laura Kvist; Severino Vitulano; Michele Pes; Laura Tomasi; Paolo Pedrini; Andreas Hilpold; Petra Kranebitter; Francesco Ceresa; Laura Kvist; Severino Vitulano; Michele Pes; Laura Tomasi; Paolo Pedrini; Andreas Hilpold; Petra Kranebitter (2025). Replication data - Sampling sites, databases for SDMs and R scripts for MLPE models - for"Landscape characteristics influence regional dispersal in a high-elevation specialist migratory bird, the water pipit Anthus spinoletta" [Dataset]. http://doi.org/10.13130/RD_UNIMI/VTSVZQ
    Explore at:
    txt(3389), application/gzip(290798), txt(1198), txt(2238), application/gzip(278965), txt(1045), txt(1685)Available download formats
    Dataset updated
    Jun 6, 2025
    Dataset provided by
    UNIMI Dataverse
    Authors
    Francesco Ceresa; Mattia Brambilla; Mattia Brambilla; Laura Kvist; Severino Vitulano; Michele Pes; Laura Tomasi; Paolo Pedrini; Andreas Hilpold; Petra Kranebitter; Francesco Ceresa; Laura Kvist; Severino Vitulano; Michele Pes; Laura Tomasi; Paolo Pedrini; Andreas Hilpold; Petra Kranebitter
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Location of sampling sites, databases (species-with-data files in RDS format) and scripts associated with the paper. Two R scripts for Maximum Likelihood Population Effects models are provided: 1. script_MLPE: R script used to fit maximum likelihood population effects models, considering the entire study area; 2. script_MLPE_sub_areas: R script used to fit maximum likelihood population effects models, considering two sub-areas separately. Three files with coordinates of sampled individuals are provided: 1. WP_coord_N: sampling sites coordinates for all individuals captured in the northern sub-area (EPSG:3035 - ETRS89 / LAEA Europe); 2. WP_coord_S: sampling sites coordinates for all individuals captured in the southern sub-area (EPSG:3035 - ETRS89 / LAEA Europe); 3. WP_coordinates: sampling sites coordinates for all individuals (EPSG:3035 - ETRS89 / LAEA Europe). Two files in RDS format (R software, package SDMtune) with testing and training locations, as well as background points, for building Species Distribution Models (EPSG:3035 - ETRS89 / LAEA Europe).

  11. b

    BioStudies database

    • bioregistry.io
    Updated Apr 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). BioStudies database [Dataset]. http://identifiers.org/re3data:r3d100012627
    Explore at:
    Dataset updated
    Apr 28, 2021
    Description

    The BioStudies database holds descriptions of biological studies, links to data from these studies in other databases at EMBL-EBI or outside, as well as data that do not fit in the structured archives at EMBL-EBI. The database can accept a wide range of types of studies described via a simple format. It also enables manuscript authors to submit supplementary information and link to it from the publication.

  12. H

    School Health Database

    • nde-dev.biothings.io
    • dataverse.harvard.edu
    • +1more
    Updated Jul 22, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National School Boards Association (NSBA) (2015). School Health Database [Dataset]. http://doi.org/10.7910/DVN/M0XH5R
    Explore at:
    Dataset updated
    Jul 22, 2015
    Dataset authored and provided by
    National School Boards Association (NSBA)
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Users can search for sample policies, practices, and articles addressing health issues affecting schools and students. Topics include: asthma, school health programs, food safety, STIs, healthy eating, physical activity, sexual orientation, and teen pregnancy, among others. Background The School Health Database is maintained by the National School Boards Association (NSBA) and is supported by the Robert Woods Johnson Foundation and the Centers for Disease Control and Prevention (CDC). The School Health Database provides abstracts for policies and practices addressing health issues affecting schools and students. This database is useful for school policymakers. Topics include: asthma; communities of color; coordinated school health programs; food sa fety/food allergies; sexually transmitted infections; healthy eating; parent, family and community environment; physical activity; sexual orientation; gender identity; sun safety; teen pregnancy; t obacco use; and wellness. User Functionality Users can search approximately 2,000 abstracts. Users can search the database by: keyword, year, and target audience. Users can request more information or free materials by completing a Request Form on the website. Data Notes This database includes nearly 2,000 abstracts regarding programs and policies affecting the school health programs across the United States.

  13. d

    NBDC - National Bioscience Database Center

    • dknet.org
    • rrid.site
    • +1more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). NBDC - National Bioscience Database Center [Dataset]. http://identifiers.org/RRID:SCR_000814
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    The National Bioscience Database Center (NBDC) intends to integrate all databases for life sciences in Japan, by linking each database with expediency to maximize convenience and make the entire system more user-friendly. We aim to focus our attention on the needs of the users of these databases who have all too often been neglected in the past, rather than the needs of the people tasked with the creation of databases. It is important to note that we will continue to honor the independent integrity of each database that will contribute to our endeavor, as we are fully aware that each database was originally crafted for specific purposes and divergent goals. Services: * Database Catalog - A catalog of life science related databases constructed in Japan that are also available in English. Information such as URL, status of the database site (active vs. inactive), database provider, type of data and subjects of the study are contained for each database record. * Life Science Database Cross Search - A service for simultaneous searching across scattered life-science databases, ranging from molecular data to patents and literature. * Life Science Database Archive - maintains and stores the datasets generated by life scientists in Japan in a long-term and stable state as national public goods. The Archive makes it easier for many people to search datasets by metadata in a unified format, and to access and download the datasets with clear terms of use. * Taxonomy Icon - A collection of icons (illustrations) of biological species that is free to use and distribute. There are more than 200 icons of various species including Bacteria, Fungi, Protista, Plantae and Animalia. * GenLibi (Gene Linker to bibliography) - an integrated database of human, mouse and rat genes that includes automatically integrated gene, protein, polymorphism, pathway, phenotype, ortholog/protein sequence information, and manually curated gene function and gene-related or co-occurred Disease/Phenotype and bibliography information. * Allie - A search service for abbreviations and long forms utilized in life sciences. It provides a solution to the issue that many abbreviations are used in the literature, and polysemous or synonymous abbreviations appear frequently, making it difficult to read and understand scientific papers that are not relevant to the reader's expertise. * inMeXes - A search service for English expressions (multiple words) that appear no less than 10 times in PubMed/MEDLINE titles or abstracts. In addition, you can easily access the sentences where the expression was used or other related information by clicking one of the search results. * HOWDY - (Human Organized Whole genome Database) is a database system for retrieving human genome information from 14 public databases by using official symbols and aliases. The information is daily updated by extracting data automatically from the genetic databases and shown with all data having the identifiers in common and linking to one another. * MDeR (the MetaData Element Repository in life sciences) - a web-based tool designed to let you search, compare and view Data Elements. MDeR is based on the ISO/IEC 11179 Part3 (Registry metamodel and basic attributes). * Human Genome Variation Database - A database for accumulating all kinds of human genome variations detected by various experimental techniques. * MEDALS - A portal site that provides information about databases, analysis tools, and the relevant projects, that were conducted with the financial support from the Ministry of Economy, Trade and Industry of Japan.

  14. H

    Data from: Could early tweet counts predict later citation counts? A gender...

    • dataverse.harvard.edu
    • search.datacite.org
    Updated Sep 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tahereh Dehdarirad (2020). Could early tweet counts predict later citation counts? A gender study in Life Sciences and Biomedicine (2014-2016) [Dataset]. http://doi.org/10.7910/DVN/GHMV8Q
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 24, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    Tahereh Dehdarirad
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The DOI list of the documents retrieved from the WOS Medline database in the research area of Life Sciences & Biomedicine from 2014-2016.

  15. o

    Data from: Mayotte rivers: databases used for the development of diatom and...

    • explore.openaire.eu
    • entrepot.recherche.data.gouv.fr
    Updated Jan 1, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Frédéric Rimet; Kalman Tapolczai; Valentin Vasselon; Nathalie Mary; Agnès Bouchez (2019). Mayotte rivers: databases used for the development of diatom and macroinvertebrates water quality tools. [Dataset]. http://doi.org/10.15454/6z5iah
    Explore at:
    Dataset updated
    Jan 1, 2019
    Authors
    Frédéric Rimet; Kalman Tapolczai; Valentin Vasselon; Nathalie Mary; Agnès Bouchez
    Area covered
    Mayotte
    Description

    Données ayant servi au projet "Développement d’outils de bio-indication «phytobenthos» et «macro-invertébrés benthiques» pour les eaux de surface continentales de Mayotte". Ce projet a été financé par l'AFB, Agence Française pour la Biodiversité Les correspondants AFB: Olivier MONNIER et Yorick REYJOL (chargés de mission)

  16. i

    Nucleotide Sequence Database

    • identifiers.org
    • bioregistry.io
    Updated Feb 13, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2014). Nucleotide Sequence Database [Dataset]. http://identifiers.org/%20insdc
    Explore at:
    Dataset updated
    Feb 13, 2014
    Description

    The International Nucleotide Sequence Database Collaboration (INSDC) consists of a joint effort to collect and disseminate databases containing DNA and RNA sequences.

  17. Desirable characteristics for database identifiers in the life sciences.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julie A. McMurry; Nick Juty; Niklas Blomberg; Tony Burdett; Tom Conlin; Nathalie Conte; Mélanie Courtot; John Deck; Michel Dumontier; Donal K. Fellows; Alejandra Gonzalez-Beltran; Philipp Gormanns; Jeffrey Grethe; Janna Hastings; Jean-Karim Hériché; Henning Hermjakob; Jon C. Ison; Rafael C. Jimenez; Simon Jupp; John Kunze; Camille Laibe; Nicolas Le Novère; James Malone; Maria Jesus Martin; Johanna R. McEntyre; Chris Morris; Juha Muilu; Wolfgang Müller; Philippe Rocca-Serra; Susanna-Assunta Sansone; Murat Sariyar; Jacky L. Snoep; Stian Soiland-Reyes; Natalie J. Stanford; Neil Swainston; Nicole Washington; Alan R. Williams; Sarala M. Wimalaratne; Lilly M. Winfree; Katherine Wolstencroft; Carole Goble; Christopher J. Mungall; Melissa A. Haendel; Helen Parkinson (2023). Desirable characteristics for database identifiers in the life sciences. [Dataset]. http://doi.org/10.1371/journal.pbio.2001414.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Julie A. McMurry; Nick Juty; Niklas Blomberg; Tony Burdett; Tom Conlin; Nathalie Conte; Mélanie Courtot; John Deck; Michel Dumontier; Donal K. Fellows; Alejandra Gonzalez-Beltran; Philipp Gormanns; Jeffrey Grethe; Janna Hastings; Jean-Karim Hériché; Henning Hermjakob; Jon C. Ison; Rafael C. Jimenez; Simon Jupp; John Kunze; Camille Laibe; Nicolas Le Novère; James Malone; Maria Jesus Martin; Johanna R. McEntyre; Chris Morris; Juha Muilu; Wolfgang Müller; Philippe Rocca-Serra; Susanna-Assunta Sansone; Murat Sariyar; Jacky L. Snoep; Stian Soiland-Reyes; Natalie J. Stanford; Neil Swainston; Nicole Washington; Alan R. Williams; Sarala M. Wimalaratne; Lilly M. Winfree; Katherine Wolstencroft; Carole Goble; Christopher J. Mungall; Melissa A. Haendel; Helen Parkinson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Desirable characteristics for database identifiers in the life sciences.

  18. m

    Database on ingestion of grit and arthropods by hummingbirds

    • data.mendeley.com
    Updated Nov 25, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Omar Maya García (2020). Database on ingestion of grit and arthropods by hummingbirds [Dataset]. http://doi.org/10.17632/rzhf2bjw4v.1
    Explore at:
    Dataset updated
    Nov 25, 2020
    Authors
    Omar Maya García
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This database contains data on the amount of grit particles and arthropods ingested by six hummingbird species sampled in a seasonal ecosystem in western Mexico. Data were collected from the stomach content analysis of the sampled hummingbirds over a one-year period. Data presented are: 1) the total number of grit particles per stomach, 2) the biomass of arthropods ingested (g dry weight/stomach), 3) the Orders of arthropods ingested, and 4) the average chitin content of arthropods ingested (percentage dry weight).

  19. H

    FAVOR Essential Database

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Apr 12, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hufeng Zhou; Theodore Arapoglou; Xihao Li; Zilin Li; Xihong Lin (2022). FAVOR Essential Database [Dataset]. http://doi.org/10.7910/DVN/1VGTJI
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 12, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Hufeng Zhou; Theodore Arapoglou; Xihao Li; Zilin Li; Xihong Lin
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Functional Annotation of Variants - Online Resource (FAVOR, https://favor.genohub.org) is a comprehensive whole-genome variant annotation database and a variant browser, providing hundreds of functional annotation scores from a variety of aspects of variant biological function. This FAVOR Essential Database is comprised of a collection of essential annotation scores for all possible SNVs (8,812,917,339) and observed indels (79,997,898) in Build GRCh38/hg38, including variant info, chromosome, position, reference allele, alternative allele, aPC-Conservation, aPC-Epigenetics, aPC-Epigenetics-Active, aPC-Epigenetics-Repressed, aPC-Epigenetics-Transcription, aPC-Local-Nucleotide-Diversity, aPC-Mappability, aPC-Mutation-Density, aPC-Protein-Function, aPC-Proximity-To-TSSTES, aPC-Transcription-Factor, CAGE promoter, CAGE, MetaSVM, rsID, FATHMM-XF, Gencode Comprehensive Category, Gencode Comprehensive Info, Gencode Comprehensive Exonic Category, Gencode Comprehensive Exonic Info, GeneHancer, LINSIGHT, CADD, rDHS. These annotation scores can be integrated into FAVORannotator (https://github.com/zhouhufeng/FAVORannotator) to create an annotated GDS (aGDS) file by storing the genotype data and their functional annotation data in an all-in-one file. The aGDS file can then facilitate a wide range of functionally-informed downstream analyses.

  20. B

    Dataset 1: Bilateral Travel Restriction Database v1.0

    • borealisdata.ca
    • dataone.org
    Updated Mar 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Global Strategy Lab (2023). Dataset 1: Bilateral Travel Restriction Database v1.0 [Dataset]. http://doi.org/10.5683/SP2/5E4OA8
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 16, 2023
    Dataset provided by
    Borealis
    Authors
    The Global Strategy Lab
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Earlier this year, Dr. Hoffman and Dr. Fafard published a book chapter on the efficacy and legality of border closures enacted by governments in response to changing COVID-19 conditions. The authors concluded border closures are at best, regarded as powerful symbolic acts taken by governments to show they are acting forcefully, even if the actions lack an epidemiological impact and breach international law. This COVID-19 travel restriction project was developed out of a necessity and desire to further examine the empirical implications of border closures. The current dataset contains bilateral travel restriction information on the status of 179 countries between 1 January 2020 and 8 June 2020. The data was extracted from the ‘international controls’ column from the Oxford COVID-19 Government Response Tracker (OxCGRT). The data in the ‘international controls’ column outlined a country’s change in border control status, as a response to COVID-19 conditions. Accompanying source links were further verified through random selection and comparison with external news sources. Greater weight is given to official national government sources, then to provincial and municipal news-affiliated agencies. The database is presented in matrix form for each country-pair and date. Subsequently, each cell is represented by datum Xdmn and indicates the border closure status on date d by country m on country n. The coding is as follows: no border closure (code = 0), targeted border closure (= 1), and a total border closure (= 99). The dataset provides further details in the ‘notes’ column if the type of closure is a modified form of a targeted closure, either as a land or port closure, flight or visa suspension, or a re-opening of borders to select countries. Visa suspensions and closure of land borders were coded separately as de facto border closures and analyzed as targeted border closures in quantitative analyses. The file titled ‘BTR Supplementary Information’ covers a multitude of supplemental details to the database. The various tabs cover the following: 1) Codebook: variable name, format, source links, and description; 2) Sources, Access dates: dates of access for the individual source links with additional notes; 3) Country groups: breakdown of EEA, EU, SADC, Schengen groups with source links; 4) Newly added sources: for missing countries with a population greater than 1 million (meeting the inclusion criteria), relevant news sources were added for analysis; 5) Corrections: external news sources correcting for errors in the coding of international controls retrieved from the OxCGRT dataset. At the time of our study inception, there was no existing dataset which recorded the bilateral decisions of travel restrictions between countries. We hope this dataset will be useful in the study of the impact of border closures in the COVID-19 pandemic and widen the capabilities of studying border closures on a global scale, due to its interconnected nature and impact, rather than being limited in analysis to a single country or region only. Statement of contributions: Data entry and verification was performed mainly by GL, with assistance from MJP and RN. MP and IW provided further data verification on the nine countries purposively selected for the exploratory analysis of political decision-making.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Maulik Kamdar (2023). Extracted Schemas from the Life Sciences Linked Open Data Cloud [Dataset]. http://doi.org/10.6084/m9.figshare.12402425.v2
Organization logo

Extracted Schemas from the Life Sciences Linked Open Data Cloud

Explore at:
txtAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Maulik Kamdar
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset is related to the manuscript "An empirical meta-analysis of the life sciences linked open data on the web" published at Nature Scientific Data. If you use the dataset, please cite the manuscript as follows:Kamdar, M.R., Musen, M.A. An empirical meta-analysis of the life sciences linked open data on the web. Sci Data 8, 24 (2021). https://doi.org/10.1038/s41597-021-00797-yWe have extracted schemas from more than 80 publicly available biomedical linked data graphs in the Life Sciences Linked Open Data (LSLOD) cloud into an LSLOD schema graph and conduct an empirical meta-analysis to evaluate the extent of semantic heterogeneity across the LSLOD cloud. The dataset published here contains the following files:- The set of Linked Data Graphs from the LSLOD cloud from which schemas are extracted.- Refined Sets of extracted classes, object properties, data properties, and datatypes, shared across the Linked Data Graphs on LSLOD cloud. Where the schema element is reused from a Linked Open Vocabulary or an ontology, it is explicitly indicated.- The LSLOD Schema Graph, which contains all the above extracted schema elements interlinked with each other based on the underlying content. Sample instances and sample assertions are also provided along with broad level characteristics of the modeled content. The LSLOD Schema Graph is saved as a JSON Pickle File. To read the JSON object in this Pickle file use the Python command as follows:with open('LSLOD-Schema-Graph.json.pickle' , 'rb') as infile: x = pickle.load(infile, encoding='iso-8859-1')Check the Referenced Link for more details on this research, raw data files, and code references.

Search
Clear search
Close search
Google apps
Main menu