100+ datasets found
  1. Source data file for Nature communications

    • figshare.com
    xlsx
    Updated Mar 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiashun Miao (2021). Source data file for Nature communications [Dataset]. http://doi.org/10.6084/m9.figshare.14196938.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 11, 2021
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Jiashun Miao
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Source data files underlying figure 2-4 in our paper

  2. r

    Nature Biotechnology FAQ - ResearchHelpDesk

    • researchhelpdesk.org
    Updated May 25, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Research Help Desk (2022). Nature Biotechnology FAQ - ResearchHelpDesk [Dataset]. https://www.researchhelpdesk.org/journal/faq/186/nature-biotechnology
    Explore at:
    Dataset updated
    May 25, 2022
    Dataset authored and provided by
    Research Help Desk
    Description

    Nature Biotechnology FAQ - ResearchHelpDesk - Nature Biotechnology is interested in the best research from across the field of Biotechnology; our broad scope ensures that work published reaches the widest possible audience. All editorial decisions are made by a team of full-time professional editors. Nature Biotechnology is a monthly journal covering the science and business of biotechnology. It publishes new concepts in technology/methodology of relevance to the biological, biomedical, agricultural and environmental sciences as well as covers the commercial, political, ethical, legal, and societal aspects of this research. The first function is fulfilled by the peer-reviewed research section, the second by the expository efforts in the front of the journal. We provide researchers with news about business; we provide the business community with news about research developments. The core areas in which we are actively seeking research papers include: molecular engineering of nucleic acids and proteins; molecular therapy (therapeutics genes, antisense, siRNAs, aptamers, DNAzymes, ribozymes, peptides, proteins); large-scale biology (genomics, functional genomics, proteomics, structural genomics, metabolomics, etc.); computational biology (algorithms and modeling), regenerative medicine (stem cells, tissue engineering, biomaterials); imaging technology; analytical biotechnology (sensors/detectors for analytes/macromolecules), applied immunology (antibody engineering, xenotransplantation, T-cell therapies); food and agricultural biotechnology; and environmental biotechnology. A comprehensive list of areas of interest is shown below. Strategies for controlling gene expression Strategies for manipulating gene structure Strategies for gene containment Technologies for analyzing gene function (e.g., arrays, SAGE) Technologies for analyzing gene structure/organization (e.g., molecular beacons) Chemogenomics or chemical genetics Pharmacogenomics/SNPs Computational analysis Technologies for analyzing/identifying protein structure/function (e.g., 2-D gels, mass spectrometry, yeast two-hybrid, SPR, NMR, arrays and chips) Structural genomics Computational analysis Technologies for analyzing/profiling metabolites (chromatography, mass spectrometry) Computational analysis Bioinformatics; algorithms; data deconvolution Modeling and systems biology: kinetics-based models and constraints-based models Rational approaches for proteins/antibodies/enzymes/drugs Molecular evolution Molecular breeding approaches Genetic manipulation of species of interest to modify or allow the production of a commercially or therapeutically relevant compound Computational analysis Mammalian cells Insect cells Bacteria Fungi Plant cells Targeting strategies Viral and nonviral vector strategies Reporter molecules Imaging approaches/technologies for visualizing whole animals, cells, or single molecules Computational analysis Gene therapy (targeting, expression, integration, immunogenicity) Antisense RNAi DNAzymes and ribozymes Nanomaterials for use in drug delivery or as therapeutics Nanomaterials for use in industrial biotechnology Nanosensors Nanosystems for imaging molecules and cells Antibody engineering T-cell therapies Therapies exploiting innate immunity (e.g. complement) Antigen delivery vectors and approaches Nucleic acid vaccines Computational analysis Stem cells Tissue engineering Therapeutic cloning (somatic cell nuclear transfer) Xenotransplantation Biomaterials Approaches for detecting biological molecules Use of biological systems in detecting analytes Approaches for multiplexing and increasing throughput Selection/screening strategies for gene/proteins/drugs Microfluidics Engineering materials for biological application Molecular imprinting Biomimetics Nanotechnology Crop improvement (resistance to stress, disease, pests) Nutraceuticals Forest biotechnology Plant vaccines Plants as bioreactors Gene-containment strategies Transgenic animals Knockouts Reproductive cloning Biopharmaceutical and enzyme production Transgene targeting and expression strategies Bioremediation Biomining Phytoremediation Monitoring

  3. Data from: Social media images from Peneda-Gerês National Park, Northern...

    • zenodo.org
    • produccioncientifica.ugr.es
    Updated Jun 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ana Sofia Cardoso; Ana Sofia Cardoso; Ricardo Moreno-Llorca; Ricardo Moreno-Llorca; Domingo Alcaraz-Segura; Domingo Alcaraz-Segura; Ana Sofia Vaz; Ana Sofia Vaz (2021). Social media images from Peneda-Gerês National Park, Northern Portugal [Dataset]. http://doi.org/10.5281/zenodo.5008842
    Explore at:
    Dataset updated
    Jun 23, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ana Sofia Cardoso; Ana Sofia Cardoso; Ricardo Moreno-Llorca; Ricardo Moreno-Llorca; Domingo Alcaraz-Segura; Domingo Alcaraz-Segura; Ana Sofia Vaz; Ana Sofia Vaz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Portugal, Norte Region
    Description

    This dataset contains images from the Peneda-Gerês National Park, Northern Portugal. The images were collected from the Flickr and Wikiloc platforms considering a time period from 2003 to 2017. In respect to the General Data Protection Regulation 2016/679, social media data protected by users’ rights was not downloaded nor analysed. Public data that would potentially contain personal information from social media users was kept anonymous through the study. Data was retrieved through the use of the freely available Flickr’s Application Programming Interface (API), indicating a time window and a bounding box with a pair of coordinates (in our case: minimum latitude: 41.653104; maximum lat.: 42.083595; min. longitude: -8.426270; max. lon.: -7.754076) around Peneda-Gerês. This information was then saved as an excel file with the following attributes: user-id, date taken, latitude, longitude, picture uniform resource locator (url).

    A first annotation, in the context of cultural ecosystem services (CES), was performed by dividing the photographs of the dataset into “Indoor” and “Outdoor” classes. Only the “Outdoor” pictures were included in this study, since CES are directly connected to nature and environment, which in turn are related to the outside/outdoor. The “Outdoor” images were further divided into two main classes, “Natural” and “Human”, depending on whether the image was dominated by natural or man-made elements. Lastly, a finer annotation for outdoor images was also provided, which encompasses the following six classes: “Species”, “Landscape”, “Nature”, “Human activities”, “Human structures” and “Posing”. “Species” pictures respectively pertained to close-up shots of animals or plants in the wild, translating CES pertaining to biodiversity appreciation. “Landscape” pictures show wide-open shots of nature in the wild, often with a visible horizon most often representing people’s enjoyment of landscape aesthetics. “Human activities” include pictures where people engage in by recreational activities, for instance related to sports such as ski or cycling. “Human structures” include those pictures where man-made structures dominate in the wild, e.g., historical monuments and churches, capturing situations of cultural heritage and spiritual enrichment. “Posing” refers to pictures with people looking at the camera, with recognizable faces, testifying social enjoyment and sense of identity. Finally, “Nature” pictures capture natural elements with no particular feature (such as species) but with an intermediate shot (differing from wide-open shots attributed to landscapes), expressing the appreciation of nature by people.

  4. f

    Spatial omics drosophila embryo

    • figshare.com
    hdf
    Updated Oct 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giovanni Palla (2022). Spatial omics drosophila embryo [Dataset]. http://doi.org/10.6084/m9.figshare.21395247.v1
    Explore at:
    hdfAvailable download formats
    Dataset updated
    Oct 25, 2022
    Dataset provided by
    figshare
    Authors
    Giovanni Palla
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    spatial gene expression of drosophila embryo from: https://www.nature.com/articles/s41592-022-01480-9 and originally https://www.nature.com/articles/s41586-019-1773-3

  5. r

    Nature Biotechnology Impact Factor 2024-2025 - ResearchHelpDesk

    • researchhelpdesk.org
    Updated Feb 23, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Research Help Desk (2022). Nature Biotechnology Impact Factor 2024-2025 - ResearchHelpDesk [Dataset]. https://www.researchhelpdesk.org/journal/impact-factor-if/186/nature-biotechnology
    Explore at:
    Dataset updated
    Feb 23, 2022
    Dataset authored and provided by
    Research Help Desk
    Description

    Nature Biotechnology Impact Factor 2024-2025 - ResearchHelpDesk - Nature Biotechnology is interested in the best research from across the field of Biotechnology; our broad scope ensures that work published reaches the widest possible audience. All editorial decisions are made by a team of full-time professional editors. Nature Biotechnology is a monthly journal covering the science and business of biotechnology. It publishes new concepts in technology/methodology of relevance to the biological, biomedical, agricultural and environmental sciences as well as covers the commercial, political, ethical, legal, and societal aspects of this research. The first function is fulfilled by the peer-reviewed research section, the second by the expository efforts in the front of the journal. We provide researchers with news about business; we provide the business community with news about research developments. The core areas in which we are actively seeking research papers include: molecular engineering of nucleic acids and proteins; molecular therapy (therapeutics genes, antisense, siRNAs, aptamers, DNAzymes, ribozymes, peptides, proteins); large-scale biology (genomics, functional genomics, proteomics, structural genomics, metabolomics, etc.); computational biology (algorithms and modeling), regenerative medicine (stem cells, tissue engineering, biomaterials); imaging technology; analytical biotechnology (sensors/detectors for analytes/macromolecules), applied immunology (antibody engineering, xenotransplantation, T-cell therapies); food and agricultural biotechnology; and environmental biotechnology. A comprehensive list of areas of interest is shown below. Strategies for controlling gene expression Strategies for manipulating gene structure Strategies for gene containment Technologies for analyzing gene function (e.g., arrays, SAGE) Technologies for analyzing gene structure/organization (e.g., molecular beacons) Chemogenomics or chemical genetics Pharmacogenomics/SNPs Computational analysis Technologies for analyzing/identifying protein structure/function (e.g., 2-D gels, mass spectrometry, yeast two-hybrid, SPR, NMR, arrays and chips) Structural genomics Computational analysis Technologies for analyzing/profiling metabolites (chromatography, mass spectrometry) Computational analysis Bioinformatics; algorithms; data deconvolution Modeling and systems biology: kinetics-based models and constraints-based models Rational approaches for proteins/antibodies/enzymes/drugs Molecular evolution Molecular breeding approaches Genetic manipulation of species of interest to modify or allow the production of a commercially or therapeutically relevant compound Computational analysis Mammalian cells Insect cells Bacteria Fungi Plant cells Targeting strategies Viral and nonviral vector strategies Reporter molecules Imaging approaches/technologies for visualizing whole animals, cells, or single molecules Computational analysis Gene therapy (targeting, expression, integration, immunogenicity) Antisense RNAi DNAzymes and ribozymes Nanomaterials for use in drug delivery or as therapeutics Nanomaterials for use in industrial biotechnology Nanosensors Nanosystems for imaging molecules and cells Antibody engineering T-cell therapies Therapies exploiting innate immunity (e.g. complement) Antigen delivery vectors and approaches Nucleic acid vaccines Computational analysis Stem cells Tissue engineering Therapeutic cloning (somatic cell nuclear transfer) Xenotransplantation Biomaterials Approaches for detecting biological molecules Use of biological systems in detecting analytes Approaches for multiplexing and increasing throughput Selection/screening strategies for gene/proteins/drugs Microfluidics Engineering materials for biological application Molecular imprinting Biomimetics Nanotechnology Crop improvement (resistance to stress, disease, pests) Nutraceuticals Forest biotechnology Plant vaccines Plants as bioreactors Gene-containment strategies Transgenic animals Knockouts Reproductive cloning Biopharmaceutical and enzyme production Transgene targeting and expression strategies Bioremediation Biomining Phytoremediation Monitoring

  6. 🧫 Promoter or not? - Bioinformatics 🗃️ Dataset

    • kaggle.com
    Updated Mar 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samira Shemirani (2024). 🧫 Promoter or not? - Bioinformatics 🗃️ Dataset [Dataset]. https://www.kaggle.com/datasets/samira1992/promoter-or-not-bioinformatics-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 31, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Samira Shemirani
    Description

    The promoter region is located near the transcription start sites, which regulate the transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, recognition of the promoter region is an important area of interest in the field of bioinformatics. Over the past years, many new promoter prediction programs (PPPs) have emerged. PPPs aim to identify promoter regions in a genome using computational methods. Promoter prediction is a supervised learning problem that consists of three main steps to extract features: 1) CpG islands 2) Structural features 3) Content features

  7. m

    Fukushima and Pollock, Supplementary Data

    • data.mendeley.com
    Updated Nov 16, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kenji Fukushima (2020). Fukushima and Pollock, Supplementary Data [Dataset]. http://doi.org/10.17632/3vcstwdbrn.2
    Explore at:
    Dataset updated
    Nov 16, 2020
    Authors
    Kenji Fukushima
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplementary Data for Fukushima and Pollock (2020, Nature Communications 11:4459) https://www.nature.com/articles/s41467-020-18090-8

  8. Single-cell datasets for distribution-based sketching

    • zenodo.org
    • explore.openaire.eu
    bin
    Updated May 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vishal Athreya Baskaran; Jolene Ranek; Siyuan Shan; Natalie Stanley; Junier Oliva; Vishal Athreya Baskaran; Jolene Ranek; Siyuan Shan; Natalie Stanley; Junier Oliva (2022). Single-cell datasets for distribution-based sketching [Dataset]. http://doi.org/10.5281/zenodo.6546964
    Explore at:
    binAvailable download formats
    Dataset updated
    May 14, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Vishal Athreya Baskaran; Jolene Ranek; Siyuan Shan; Natalie Stanley; Junier Oliva; Vishal Athreya Baskaran; Jolene Ranek; Siyuan Shan; Natalie Stanley; Junier Oliva
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Contains preprocessed single-cell data for sketching single-cell samples. Preprocessed adata objects can be accessed using the 'read_h5ad' function in Scanpy.

  9. d

    Datasets for manuscript: The nature, representation and measure of genetic...

    • search.dataone.org
    • datadryad.org
    • +1more
    Updated May 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Steinar Thorvaldsen (2025). Datasets for manuscript: The nature, representation and measure of genetic information [Dataset]. http://doi.org/10.5061/dryad.h44j0zpn6
    Explore at:
    Dataset updated
    May 21, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Steinar Thorvaldsen
    Time period covered
    Jan 1, 2022
    Description

    Current studies in genetics very often refer to notions from information science. The concept of genetic information is still disputed because it attributes semantic traits to what seem to be regular biochemical entities. Some researchers maintain that the use of information in biology is just metaphorical and maybe even misleading. In this paper, we offer an analysis of the nature and characteristics of the use of information in proteins, protein families, and their sequences. It is argued that the foundation of the metaphorical view is relatively weak given the current findings in bioinformatics, and it is shown that the present understanding of genetics fits well into the context of the modern philosophy of information. Here, we propose an extension of Floridi’s conceptual model of information to include genetic information better. In addition, we discuss how to understand the qualitative aspects of genetic information and how to measure its quantitative aspects and present a joint s...

  10. Haber regions

    • figshare.com
    hdf
    Updated May 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lukas Heumos (2025). Haber regions [Dataset]. http://doi.org/10.6084/m9.figshare.21533982.v2
    Explore at:
    hdfAvailable download formats
    Dataset updated
    May 5, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Lukas Heumos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A subset of the Haber dataset that only contains the regions. It explicitly does not include the large cells.

    https://www.nature.com/articles/nature24489

  11. Supplementary material 8 from: Horn T (2016) Integrating Biodiversity Data...

    • zenodo.org
    csv
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thomas Horn; Thomas Horn (2020). Supplementary material 8 from: Horn T (2016) Integrating Biodiversity Data into Botanic Collections. Biodiversity Data Journal 4: e7971. https://doi.org/10.3897/BDJ.4.e7971 [Dataset]. http://doi.org/10.3897/bdj.4.e7971.suppl8
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Thomas Horn; Thomas Horn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    List of 8383 taxon names with the number of public records in BOLD. The column "type" indicates if the original name was found at BOLD or if an alternative name from TPL (TPLsynonym or TPLaccepted) was found at BOLD.

  12. m

    Inter-residue distances surrounding the ligand data sets using MANORAA

    • data.mendeley.com
    • narcis.nl
    Updated Sep 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Duangrudee Tanramluk (2021). Inter-residue distances surrounding the ligand data sets using MANORAA [Dataset]. http://doi.org/10.17632/4z4mypck9b.3
    Explore at:
    Dataset updated
    Sep 22, 2021
    Authors
    Duangrudee Tanramluk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Distances measured between distinctive parts of amino acid residues surrounding the ligand.

  13. r

    Nature Reviews Molecular Cell Biology CiteScore 2024-2025 - ResearchHelpDesk...

    • researchhelpdesk.org
    Updated May 6, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Research Help Desk (2022). Nature Reviews Molecular Cell Biology CiteScore 2024-2025 - ResearchHelpDesk [Dataset]. https://www.researchhelpdesk.org/journal/sjr/605/nature-reviews-molecular-cell-biology
    Explore at:
    Dataset updated
    May 6, 2022
    Dataset authored and provided by
    Research Help Desk
    Description

    Nature Reviews Molecular Cell Biology CiteScore 2024-2025 - ResearchHelpDesk - Molecular cell biology is a marriage of two distinct, yet complementary, disciplines. In its traditional sense, the term 'molecular biology' refers to the study of the macromolecules essential to life — nucleic acids and proteins. The field of cell biology is a natural extension of this, integrating what we know at the molecular level into an understanding of processes and interactions at the cellular level. Only by combining both fields can we paint a broad picture of essential biological processes such as how cells divide, grow, communicate and die. Nature Reviews Molecular Cell Biology features Reviews, Perspective articles and Comments on a broad range of topics, and highlights important primary papers and technological progress. Reviews, Perspectives and Comments are commissioned by the editorial team. The scope of the journal includes: Cell signalling (signalling networks, ion channels, gap junctions) Membrane dynamics (membrane organization, endocytosis, exocytosis, organelle biogenesis) Cell adhesion (adhesion molecules, extracellular matrix) Cytoskeletal dynamics (cell motility, molecular motors, actin, microtubules, intermediate filaments) Developmental and stem cell biology Cell growth and division (cell cycle, cytokinesis, cancer) Cell death (apoptosis, necrosis, autophagy, ageing) Cellular microbiology (host–pathogen interactions) Plant cell biology Gene expression (transcription, splicing, RNA stability, translation, RNA interference, circadian rhythms) Nucleic-acid metabolism (DNA repair, recombination and replication, RNA biogenesis) Chromosome biology and nuclear architecture (chromatin, chromosome structure, transposons) Nuclear transport (import and export of molecules to and from the nucleus) Protein structure and metabolism (structure-function relationships, quality control, post-translational modifications, folding, translocation, degradation) Bioenergetics (respiration, photosynthesis, organelle biochemistry) Technology and techniques (imaging, proteomics, systems biology, bioinformatics)

  14. 🔬 Essential Proteins - Bioinformatics 🗃️ Dataset

    • kaggle.com
    Updated Oct 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samira Shemirani (2023). 🔬 Essential Proteins - Bioinformatics 🗃️ Dataset [Dataset]. https://www.kaggle.com/samira1992/essential-proteins-bioinformatics-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 19, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Samira Shemirani
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Essential proteins are vital for the life and reproduction of organisms and play a crucial role in maintaining cellular functions. If the destruction of a certain protein would lead to lethality or infertility, it can be classified as essential to an organism, meaning the organism cannot survive without it. Compared to non-essential proteins, essential proteins are more likely to persist in biological evolution. For instance, essential proteins make excellent targets for the development of new potential drugs and vaccines aimed at treating and preventing diseases.

    With the advent of high-throughput technologies, such as the yeast two-hybrid system and mass spectrometry analysis, various protein-protein interaction (PPI) data become available, facilitating the study of essential proteins at the network level.

  15. [Dataset] Data for the course "Population Genomics" at Aarhus University

    • zenodo.org
    application/gzip, bin
    Updated Jan 8, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuele Soraggi; Samuele Soraggi; Kasper Munch; Kasper Munch (2025). [Dataset] Data for the course "Population Genomics" at Aarhus University [Dataset]. http://doi.org/10.5281/zenodo.7670839
    Explore at:
    application/gzip, binAvailable download formats
    Dataset updated
    Jan 8, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Samuele Soraggi; Samuele Soraggi; Kasper Munch; Kasper Munch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Datasets, conda environments and Softwares for the course "Population Genomics" of Prof Kasper Munch. This course material is maintained by the health data science sandbox. This webpage shows the latest version of the course material.

    1. Data.tar.gz Contains the datasets and executable files for some of the softwares
      You can unpack by simply doing
      tar -zxf Data.tar.gz -C ./
      This will create a folder called Data with the uncompressed material inside
    2. Course_Env.packed.tar.gz Contains the conda environment used for the course. This needs to be unpacked to adjust all the prefixes (Note this environment is created on Ubuntu 22.10). You do this in the command line by
      1. creating the folder Course_Env: mkdir Course_Env
      2. untar the file: tar -zxf Course_Env.packed.tar.gz -C Course_Env
      3. Activate the environment: conda activate ./Course_Env
      4. Run the unpacking script (it can take quite some time to get it done): conda-unpack
    3. Course_Env.unpacked.tar.gz The same environment as above, but will work only if untarred into the folder /usr/Material - so use the version above if you are using it in another folder. This file is mostly to execute the course in our own cloud environment.
    4. environment_with_args.yml The file needed to generate the conda environment. Create and activate the environment with the following commands:
      1. conda env create -f environment_with_args.yml -p ./Course_Env
      2. conda activate ./Course_Env

    The data is connected to the following repository: https://github.com/hds-sandbox/Popgen_course_aarhus. The original course material from Prof Kasper Munch is at https://github.com/kaspermunch/PopulationGenomicsCourse.

    Description

    The participants will after the course have detailed knowledge of the methods and applications required to perform a typical population genomic study.

    The participants must at the end of the course be able to:

    • Identify an experimental platform relevant to a population genomic analysis.
    • Apply commonly used population genomic methods.
    • Explain the theory behind common population genomic methods.
    • Reflect on strengths and limitations of population genomic methods.
    • Interpret and analyze results of population genomic inference.
    • Formulate population genetics hypotheses based on data

    The course introduces key concepts in population genomics from generation of population genetic data sets to the most common population genetic analyses and association studies. The first part of the course focuses on generation of population genetic data sets. The second part introduces the most common population genetic analyses and their theoretical background. Here topics include analysis of demography, population structure, recombination and selection. The last part of the course focus on applications of population genetic data sets for association studies in relation to human health.

    Curriculum

    The curriculum for each week is listed below. "Coop" refers to a set of lecture notes by Graham Coop that we will use throughout the course.

    Course plan

    1. Course intro and overview:
    2. Drift and the coalescent:
    3. Recombination:
    4. Population strucure and incomplete lineage sorting:
    5. Hidden Markov models:
    6. Ancestral recombination graphs:
    7. Past population demography:
    8. Direct and linked selection:
    9. Admixture:
    10. Genome-wide association study (GWAS):
    11. Heritability:
      • Lecture: Coop Lecture notes Sec. 2.2 (p23-36) + Chap. 7 (p119-142)
      • Exercise: Association testing
    12. Evolution and disease:
      • Lecture: Coop Lecture notes Sec. 11.0.1 (p217-221)
      • Exercise: Estimating heritability
  16. m

    BioEmergences-wt4

    • data.mendeley.com
    Updated Apr 28, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David Pastor-Escuredo (2022). BioEmergences-wt4 [Dataset]. http://doi.org/10.17632/p9hwtxtnfy.1
    Explore at:
    Dataset updated
    Apr 28, 2022
    Authors
    David Pastor-Escuredo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BioEmergences-wt4: Zebrafish wild-type embryo during Gastrulation. Raw data 3D+time: nuclei + membrane channels (VTK). Digital cell lineage (computed with BioEmergences workflow https://www.nature.com/articles/ncomms9674).

  17. Data from: Inference of selective force on house mice genomes during...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    bin, zip
    Updated Mar 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kazumichi Fujiwara; Kazumichi Fujiwara; Naoki Osada; Hitoshi Suzuki; Naoki Osada; Hitoshi Suzuki (2024). Data from: Inference of selective force on house mice genomes during secondary contact in East Asia [Dataset]. http://doi.org/10.5061/dryad.9p8cz8wnb
    Explore at:
    bin, zipAvailable download formats
    Dataset updated
    Mar 12, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Kazumichi Fujiwara; Kazumichi Fujiwara; Naoki Osada; Hitoshi Suzuki; Naoki Osada; Hitoshi Suzuki
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The house mouse (Mus musculus), commensal to humans, has spread globally via human activities, leading to secondary contact between genetically divergent subspecies. This pattern of genetic admixture can provide insights into the selective forces at play in this well-studied model organism. Our analysis of 163 house mouse genomes, mainly from East Asia, revealed substantial admixture between the subspecies castaneus and musculus, particularly in Japan and southern China. We revealed, despite the admixture, that all Y chromosomes in the East Asian samples belonged to the musculus-type haplogroup, potentially explained by genomic conflict under sex ratio distortion due to varying copy numbers of ampliconic genes on sex chromosomes. We also investigated the influence of natural selection on the post-hybridization of the subspecies castaneus and musculus in Japan. Even though the genetic background of most Japanese samples closely resembles the subspecies musculus, certain genomic regions overrepresented the castaneus-like genetic components, particularly in immune-related genes. Furthermore, a large genomic block containing a vomeronasal/olfactory receptor gene cluster predominantly harbored castaneus-type haplotypes in the Japanese samples, highlighting the possible role of olfaction-based recognition in shaping hybrid genomes.

  18. CERES - Meyers, Bryan, et al. Nature Genetics. 2017.

    • figshare.com
    txt
    Updated Oct 30, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robin Meyers (2017). CERES - Meyers, Bryan, et al. Nature Genetics. 2017. [Dataset]. http://doi.org/10.6084/m9.figshare.5319388.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Oct 30, 2017
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Robin Meyers
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Datasets used in CERES publication.Meyers, Bryan, et al. Computational correction of copy-number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. Nature Genetics. 2017.External data:CCLE_copynumber_2013-12-03.seg.txtCCLE copy number data downloaded from https://data.broadinstitute.org/ccle_legacy_data/dna_copy_number/CCLE_copynumber_2013-12-03.seg.txtCCLE_RNAseq_081117.rpkm.gctCCLE gene expression data downloaded from https://data.broadinstitute.org/ccle/CCLE_RNAseq_081117.rpkm.gctccle2maf_081117.txtCCLE mutation data downloaded from https://data.broadinstitute.org/ccle/ccle2maf_081117.txtAchilles_v2.20.2_GeneSolutions.gctRNAi DEMETER gene dependencies downloaded from https://portals.broadinstitute.org/achilles/datasets/12/download/Achilles_v2.20.2_GeneSolutions.gctCCDS.current.txtCCDS gene annotations downloaded from ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/archive/15/CCDS.current.txtc2.all.v6.0.symbols.gmt MSigDB genesets downloaded from http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/6.0/c2.all.v6.0.symbols.gmtavana_rs2.txt and gecko_rs2.txtDoench-Root scores for sgRNAs provided by authors of Doench, et al. Nat. Biotechnol. 2016.

    wang_pool_normalized.counts.txtDownloaded from supplement of Wang, et al. Cell. 2017.

  19. Supplementary material 7 from: Horn T (2016) Integrating Biodiversity Data...

    • zenodo.org
    csv
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thomas Horn; Thomas Horn (2020). Supplementary material 7 from: Horn T (2016) Integrating Biodiversity Data into Botanic Collections. Biodiversity Data Journal 4: e7971. https://doi.org/10.3897/BDJ.4.e7971 [Dataset]. http://doi.org/10.3897/bdj.4.e7971.suppl7
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Thomas Horn; Thomas Horn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All exotic and invasive taxa detected using DAISIE

  20. Data from: Paired omics Data Platform projects

    • zenodo.org
    • doi.org
    • +3more
    zip
    Updated Jan 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefan Verhoeven; Stefan Verhoeven; Michelle Schorn; Marnix H. Medema; Marnix H. Medema; Pieter C. Dorrestein; Pieter C. Dorrestein; Justin J.J. van der Hooft; Justin J.J. van der Hooft; Michelle Schorn (2023). Paired omics Data Platform projects [Dataset]. http://doi.org/10.5281/zenodo.3736431
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 1, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Stefan Verhoeven; Stefan Verhoeven; Michelle Schorn; Marnix H. Medema; Marnix H. Medema; Pieter C. Dorrestein; Pieter C. Dorrestein; Justin J.J. van der Hooft; Justin J.J. van der Hooft; Michelle Schorn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Paired Omics Data Platform is a community-based initiative standardizing links between genomic and metabolomics data in a computer readable format to further the field of natural products discovery. The goals are to link molecules to their producers, find large scale genome-metabolome associations, use genomic data to assist in structural elucidation of molecules, and provide a centralized database for paired datasets. This dataset contains the projects in http://pairedomicsdata.bioinformatics.nl/.

    The JSON documents adhere to the http://pairedomicsdata.bioinformatics.nl/schema.json JSON schema.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jiashun Miao (2021). Source data file for Nature communications [Dataset]. http://doi.org/10.6084/m9.figshare.14196938.v1
Organization logo

Source data file for Nature communications

Explore at:
xlsxAvailable download formats
Dataset updated
Mar 11, 2021
Dataset provided by
Figsharehttp://figshare.com/
Authors
Jiashun Miao
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Source data files underlying figure 2-4 in our paper

Search
Clear search
Close search
Google apps
Main menu