Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These files are part of a bioinformatics workshop. The accompanying websites are available at http://sschmeier.github.io/bioinf-workshop/
RNA sequencing (RNA-Seq) is a powerful tool that captures information about how organisms respond to stimuli in their environment at the molecular level. A common RNA-Seq approach involves isolating and sequencing all of the messenger RNA (mRNA) in a tissue sample taken from an organism. Researchers can compare patterns observed in RNA-Seq data to understand how individuals respond to the environment over minutes, hours, or days and how populations evolve in response to the environment over millions of years. The materials in this repository will guide users through an analysis of RNA-Seq data collected from two California populations of a copepod crustacean, Tigriopus californicus, that were exposed to different levels of salinity. Users will examine the contents of a fastq file that contains raw RNA-Seq data, determine the quality of the RNA-Seq data using a web-server, and test for significant differences in gene expression between the copepod populations using the R packages DE...
Advances in high-throughput techniques have resulted in a rising demand for scientists with basic bioinformatics skills as well as workshops and curricula that teach students bioinformatics concepts. DNA Detective is a workshop we designed to introduce students to big data and bioinformatics using CyVerse and the Dolan DNA Learning Center's online DNA Subway platform. DNA Subway is a user-friendly workspace for genome analysis and uses the metaphor of a network of subway lines to familiarize users with the steps involved in annotating and comparing DNA sequences. For DNA Detective, we use the DNA Subway Red Line to guide students through analyzing a "mystery" DNA sequence to distinguish its gene structure and name. During the workshop, students are assigned a unique Arabidopsis thaliana DNA sequence. Students "travel" the Red Line to computationally find and remove sequence repeats, use gene prediction software to identify structural elements of the sequence, search databases of known genes to determine the identity of their mystery sequence, and synthesize these results into a model of their gene. Next, students use The Arabidopsis Information Resource (TAIR) to identify their gene's function so they can hypothesize what a mutant plant lacking that gene might look like (its phenotype). Then, from a group of plants in the room, students select the plant they think is most likely defective for their gene. Through this workshop, students are acquainted to the flow of genetic information from genotype to phenotype and tackle complex genomics analyses in hopes of inspiring and empowering them towards continued science education.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Post-processed FreeBayes variant calls for use in the 2020 bioinformatics workshop for unit CEA301.
These are derived from data published previously in Training material for the course "Exome analysis with GALAXY". Credit for uploading the original data goes to Paolo Uva and Gianmauro Cuccuru!
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Four PBMC scRNA-seq datasets (~1,000 cells each) for analysis with CBW_CAN_SingleCell_*R scripts
MTX files were downloaded from 10X publicly available datasets.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Seven glioblastoma datasets from Richards et al 2021 (Nature Cancer) used in the authors Figure 7, are provided here as raw UMI count matrices (cell barcodes vs. genes) in MTX format. Also code to run a scRNA-seq analysis using those files and cell-level metadata.
This is my first resource. Visit https://dataone.org/datasets/sha256%3A0af80cf07445202b04e935d34961b969a43ea995a5a2f323352ab2ed5915c188 for complete metadata about this dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Workshop Role and Definition.
Program for a bioinformatics workshop organized through the Central African Biodiversity (CAB) Alliance and held at the Université des Sciences et Techniques de Masuku (USTM) in Franceville, Gabon (July 3-8, 2017). Participants and instructors came from Gabon (USTM, the Institut de Recherche en Ecologie Tropicale, the Centre International de Recherches Médicales de Franceville), the U.S. (University of New Orleans), Cameroon (University of Buea) and South Africa (University of Cape Town).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Bioinformatics training events using the hybrid methodology.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Bioinformatics workshop and conferences organized in Pakistan.
This record includes training materials associated with the Australian BioCommons workshop ‘Introduction to Machine Learning in R - from data to knowledge’. This workshop took place over one, 4 hour sessions on 09 December 2024. Event description With the rise in high-throughput sequencing technologies, the volume of omics data has grown exponentially. A major issue is to mine useful knowledge from these heterogeneous collections of data. The analysis of complex high-volume data is not trivial and classical tools cannot be used to explore their full potential. Machine Learning (ML), a discipline in which computers perform automated learning without being programmed explicitly and assist humans to make sense of large and complex data sets, can thus be very useful in mining large omics datasets to uncover new insights that can advance the field of bioinformatics. This hands-on workshop will introduce participants to the ML taxonomy and the applications of common ML algorithms to health data. The workshop will cover the foundational concepts and common methods being used to analyse omics data sets by providing a practical context through the use of basic but widely used R libraries. Participants will acquire an understanding of the standard ML processes, as well as the practical skills in applying them on familiar problems and publicly available real-world data sets. Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event. Lead trainers: Dr Fotis Psomopoulos, Senior Researcher, Institute of Applied Biosciences (INAB), Center for Research and Technology Hellas (CERTH) Facilitators: Dr Giorgia Mori, Australian BioCommons Dr Eden Zhang, Sydney Informatics Hub Dr Erin Graham, Queensland Cyber Infrastructure Foundation (QCIF) Infrastructure provision: Uwe Winter, Australian BioCommons Host: Dr. Giorgia Mori, Australian BioCommons Training materials Files and materials included in this record: Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc. Training materials webpage Data and documentation
RNA expression analysis was performed on the corpus luteum tissue at five time points after prostaglandin F2 alpha treatment of midcycle cows using an Affymetrix Bovine Gene v1 Array. The normalized linear microarray data was uploaded to the NCBI GEO repository (GSE94069). Subsequent statistical analysis determined differentially expressed transcripts ± 1.5-fold change from saline control with P ≤ 0.05. Gene ontology of differentially expressed transcripts was annotated by DAVID and Panther. Physiological characteristics of the study animals are presented in a figure. Bioinformatic analysis by Ingenuity Pathway Analysis was curated, compiled, and presented in tables. A dataset comparison with similar microarray analyses was performed and bioinformatics analysis by Ingenuity Pathway Analysis, DAVID, Panther, and String of differentially expressed genes from each dataset as well as the differentially expressed genes common to all three datasets were curated, compiled, and presented in tables. Finally, a table comparing four bioinformatics tools' predictions of functions associated with genes common to all three datasets is presented. These data have been further analyzed and interpreted in the companion article "Early transcriptome responses of the bovine mid-cycle corpus luteum to prostaglandin F2 alpha includes cytokine signaling". Resources in this dataset:Resource Title: Supporting information as Excel spreadsheets and tables. File Name: Web Page, url: http://www.sciencedirect.com/science/article/pii/S2352340917304031?via=ihub#s0070
This record includes training materials associated with the Australian BioCommons workshop ‘Machine Learning in the Life Sciences’. This on 11 June 2024. Event description Machine learning promises to revolutionise life science research by speeding up data analysis, enabling prediction of biological patterns and modelling complex biological systems. But what exactly is machine learning and when should you use it? This hands-on online workshop provides a high-level introduction to machine learning: what it is, its advantages and disadvantages compared to traditional modelling approaches and the types of scenarios where it may be the right tool for the job. Using example datasets and basic machine learning pipelines we contrast a few commonly used algorithms for constructing predictive models and explore some of their trade-offs. We discuss common pitfalls in how machine learning is applied and evaluated, with a focus on its application in the life sciences, to help you recognise overly optimistic results. We discuss how and why such errors arise and strategies to avoid them. Lead trainer: Dr Benjamin Goudey, Research Fellow, Florey Department of Neuroscience and Mental Health Facilitators: Dr Erin Graham, Queensland Cyber Infrastructure Foundation (QCIF) / James Cook University William Pinzon Perez, Queensland Cyber Infrastructure Foundation (QCIF) Dr Giorgia Mori, Sydney Informatics Hub, University of Sydney Joseph McConnell, University of Adelaide Jessica Chung, Melbourne Bioinformatics 0000-0002-0627-0955 Host: Dr Melissa Burke, Australian BioCommons. Training materials Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event. Files and materials included in this record: Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc. Schedule (PDF): Schedule describing the timing of sessions for the in person and online events Materials shared elsewhere: This workshop follows the Google Colab Notebook developed by Dr Benjamin Goudey: https://github.com/bwgoudey/IntroMLforLifeScienceWorkshopR
This record includes training materials associated with the Australian BioCommons workshop ‘Make your bioinformatics workflows findable and citable’. This workshop took place on 21 March 2023. Event description Computational workflows are invaluable resources for research communities. They help us standardise common analyses, collaborate with other researchers, and support reproducibility. Bioinformatics workflow developers invest significant time and expertise to create, share, and maintain these resources for the benefit of the wider community and being able to easily find and access workflows is an essential factor in their uptake by the community. Increasingly, the research community is turning to workflow registries to find and access public workflows that can be applied to their research. Workflow registries support workflow findability and citation by providing a central repository and allowing users to search for and discover them easily. This workshop will introduce you to workflow registries and support attendees to register their workflows on the popular workflow registry, WorkflowHub. We’ll kick off the workshop with an introduction to the concepts underlying workflow findability, how it can benefit workflow developers, and how you can make the most of workflow registries to share your computational workflows with the research community. You will then have the opportunity to register your own workflows in WorkflowHub with support from our trainers. Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event. Files and materials included in this record: Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc. Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file. 2023-03-21_Workflows_slides (PDF): A copy of the slides presented during the workshop Materials shared elsewhere: A recording of the first part of this workshop is available on the Australian BioCommons YouTube Channel: https://youtu.be/2kGKxaPuQN8
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The 4th German Crop BioGreenformatics Network (GCBN, https://www.denbi.de/gcbn) user training provided a hands-on introduction to useful bioinformatics tools for biologists with little or no previous knowledge. The training enabled biologists to process their own small and large datasets using R and Linux based methods and is entirely computer-based with interspersed lectures. The first part started with an introduction into the use and basic administration (software installation) of the Linux distribution Ubuntu and demonstrated the first steps into the use the R software (trainer Andrea Bräutigam, folder AB). In part two the use of Blast+ in the command line version, of simple Linux commands like 'cut', of Perl scripts and the graphical user interface of the phylogeny tool 'seaview' were demonstrated (trainer Uwe Scholz, folder US). The third session introduced basic concepts and practical tools for processing biological datasets in Linux. In particular, 'awk' and 'sed' were used. Moreover, 'SAMtools' and 'BEDTools' were applied (trainer Martin Mascher, folder MM). In the fourth part 'Introduction to Databases' a quick start guide to use relational databases was presented. By providing easy examples, this lesson set the fundamentals to motivate to use relational database systems as daily bioinformatics tool to store, retrieve and even analyze -omics data in the big data age (trainer Matthias Lange, folder ML). The last part introduced basic statistics to biologist, teach some commonly used statistical methods and demonstrated the creation of graphical visualizations with software R (trainer Yusheng Zhao, folder YZ).
This record includes training materials associated with the Australian BioCommons workshop ���Working with genomics sequences and features in R with Bioconductor���. This workshop took place on 23 September 2021. Workshop description Explore the many useful functions that the Bioconductor environment offers for working with genomic data and other biological sequences. DNA and proteins are often represented as files containing strings of nucleic acids or amino acids. They are associated with text files that provide additional contextual information such as genome annotations. This workshop provides hands-on experience with tools, software and packages available in R via Bioconductor for manipulating, exploring and extracting information from biological sequences and annotation files. We will look at tools for working with some commonly used file formats including FASTA, GFF3, GTF, methods for identifying regions of interest, and easy methods for obtaining data packages such as genome assemblies. This workshop is presented by the Australian BioCommons and Monash Bioinformatics Platform with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative. Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event. Files and materials included in this record: Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc. Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file. Schedule (PDF): schedule for the workshop providing a breakdown of topics and timings Materials shared elsewhere: This workshop follows the tutorial ���Working with DNA sequences and features in R with Bioconductor - version 2��� developed for Monash Bioinformatics Platform and Monash Data Fluency by Paul Harrison. https://monashdatafluency.github.io/r-bioc-2/ Version 2: Grammatical and typographic errors were corrected in the Event Metadata file.
This record includes training materials associated with the Australian BioCommons workshop 'Single cell RNAseq analysis in R'. This workshop took place over two, 3.5 hour sessions on 26 and 27 October 2023. Event description Analysis and interpretation of single cell RNAseq (scRNAseq) data requires dedicated workflows. In this hands-on workshop we will show you how to perform single cell analysis using Seurat - an R package for QC, analysis, and exploration of single-cell RNAseq data. We will discuss the 'why' behind each step and cover reading in the count data, quality control, filtering, normalisation, clustering, UMAP layout and identification of cluster markers. We will also explore various ways of visualising single cell expression data. This workshop is presented by the Australian BioCommons, Queensland Cyber Infrastructure Foundation (QCIF) and the Monash Genomics and Bioinformatics Platform with the assistance of a network of facilitators from the national Bioinformatics Training Cooperative. Lead trainers: Sarah Williams, Adele Barugahare, Paul Harrison, Laura Perlaza Jimenez Facilitators: Nick Matigan, Valentine Murigneux, Magdalena (Magda) Antczak Infrastructure provision: Uwe Winter Coordinator: Melissa Burke Training materials Materials are shared under a Creative Commons Attribution 4.0 International agreement unless otherwise specified and were current at the time of the event. Files and materials included in this record: Event metadata (PDF): Information about the event including, description, event URL, learning objectives, prerequisites, technical requirements etc. Index of training materials (PDF): List and description of all materials associated with this event including the name, format, location and a brief description of each file. scRNAseq_Schedule (PDF): A breakdown of the topics and timings for the workshop Materials shared elsewhere: This workshop follows the tutorial 'scRNAseq Analysis in R with Seurat' https://swbioinf.github.io/scRNAseqInR_Doco/index.html Slides used to introduce key topics are available via GitHub https://github.com/swbioinf/scRNAseqInR_Doco/tree/main/slides This material is based on the introductory Guided Clustering Tutorial tutorial from Seurat. It is also drawing from a similar workshop held by Monash Bioinformatics Platform Single-Cell-Workshop, with material here.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Epidemics of emerging and re-emerging infectious diseases are a danger to civilian and military populations worldwide. Health security and mitigation of infectious disease threats is a priority of the United States Government and the Department of Defense (DoD). Next generation sequencing (NGS) and Bioinformatics (BI) enhances traditional biosurveillance by providing additional data to understand transmission, identify resistance and virulence factors, make predictions, and update risk assessments. As more and more laboratories adopt NGS and BI technologies they encounter challenges in building local capacity. In addition to choosing the right sequencing platform and approach, considerations must also be made for the complexity of bioinformatics analyses, data storage, as well as personnel and computational requirements. To address these needs, a comprehensive training program was developed covering wet lab and bioinformatics approaches to NGS. The program is meant to be modular and adaptive to meet both common and individualized needs of medical research and public health laboratories across the DoD. The training program was first deployed internationally to the Basic Science Laboratory of the US Army Medical Research Directorate-Africa in Kisumu, Kenya, which is an overseas Lab of the Walter Reed Army Institute of Research (WRAIR). A week-long workshop with intensive focus on targeted sequencing and the bioinformatics of genome assembly (n = 24 participants) was held. Post-workshop self-assessment (completed by 21 participants) noted significant median gains in knowledge domains related to NGS targeted sequencing, bioinformatics for genome assembly, and sequence quality assessment. The participants also reported that the information on study design, sample preparation, sequencing quality control, data quality assessment, reporting, and basic and advanced bioinformatics analysis were the most useful information presented in the training. While longer-term evaluations are planned, the training resulted in significant short-term improvement of a laboratory’s self-reported wet lab and bioinformatics capabilities. This framework can be used for future DoD laboratory development in the area of NGS and BI for infectious disease surveillance, ultimately enhancing this global DoD capability.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This training dataset is from 2 imaginary microbiome samples. Each one is from a paired end 16S amplicon sequencing run and contains 2 fastq files (forwards and reverse.)
It is a useful dataset for demonstrating:
16S metagenomics analysis techniques
Differences between microbiome samples
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These files are part of a bioinformatics workshop. The accompanying websites are available at http://sschmeier.github.io/bioinf-workshop/