100+ datasets found
  1. B

    Open Data Training Workshop: Case Studies in Open Data for Qualitative and...

    • borealisdata.ca
    • search.dataone.org
    Updated Apr 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Srinvivas Murthy; Maggie Woo Kinshella; Jessica Trawin; Teresa Johnson; Niranjan Kissoon; Matthew Wiens; Gina Ogilvie; Gurm Dhugga; J Mark Ansermino (2023). Open Data Training Workshop: Case Studies in Open Data for Qualitative and Quantitative Clinical Research [Dataset]. http://doi.org/10.5683/SP3/BNNAE7
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 18, 2023
    Dataset provided by
    Borealis
    Authors
    Srinvivas Murthy; Maggie Woo Kinshella; Jessica Trawin; Teresa Johnson; Niranjan Kissoon; Matthew Wiens; Gina Ogilvie; Gurm Dhugga; J Mark Ansermino
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Dataset funded by
    Digital Research Alliance of Canada
    Description

    Objective(s): Momentum for open access to research is growing. Funding agencies and publishers are increasingly requiring researchers make their data and research outputs open and publicly available. However, clinical researchers struggle to find real-world examples of Open Data sharing. The aim of this 1 hr virtual workshop is to provide real-world examples of Open Data sharing for both qualitative and quantitative data. Specifically, participants will learn: 1. Primary challenges and successes when sharing quantitative and qualitative clinical research data. 2. Platforms available for open data sharing. 3. Ways to troubleshoot data sharing and publish from open data. Workshop Agenda: 1. “Data sharing during the COVID-19 pandemic” - Speaker: Srinivas Murthy, Clinical Associate Professor, Department of Pediatrics, Faculty of Medicine, University of British Columbia. Investigator, BC Children's Hospital 2. “Our experience with Open Data for the 'Integrating a neonatal healthcare package for Malawi' project.” - Speaker: Maggie Woo Kinshella, Global Health Research Coordinator, Department of Obstetrics and Gynaecology, BC Children’s and Women’s Hospital and University of British Columbia This workshop draws on work supported by the Digital Research Alliance of Canada. Data Description: Presentation slides, Workshop Video, and Workshop Communication Srinivas Murthy: Data sharing during the COVID-19 pandemic presentation and accompanying PowerPoint slides. Maggie Woo Kinshella: Our experience with Open Data for the 'Integrating a neonatal healthcare package for Malawi' project presentation and accompanying Powerpoint slides. This workshop was developed as part of Dr. Ansermino's Data Champions Pilot Project supported by the Digital Research Alliance of Canada. NOTE for restricted files: If you are not yet a CoLab member, please complete our membership application survey to gain access to restricted files within 2 business days. Some files may remain restricted to CoLab members. These files are deemed more sensitive by the file owner and are meant to be shared on a case-by-case basis. Please contact the CoLab coordinator on this page under "collaborate with the pediatric sepsis colab."

  2. d

    Replication Data for: A Three-Year Mixed Methods Study of Undergraduates’...

    • dataone.org
    • dataverse.azure.uit.no
    • +2more
    Updated Oct 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nierenberg, Ellen (2024). Replication Data for: A Three-Year Mixed Methods Study of Undergraduates’ Information Literacy Development: Knowing, Doing, and Feeling [Dataset]. http://doi.org/10.18710/SK0R1N
    Explore at:
    Dataset updated
    Oct 9, 2024
    Dataset provided by
    DataverseNO
    Authors
    Nierenberg, Ellen
    Time period covered
    Aug 8, 2019 - Jun 10, 2022
    Description

    This data set contains the replication data and supplements for the article "Knowing, Doing, and Feeling: A three-year, mixed-methods study of undergraduates’ information literacy development." The survey data is from two samples: - cross-sectional sample (different students at the same point in time) - longitudinal sample (the same students and different points in time)Surveys were distributed via Qualtrics during the students' first and sixth semesters. Quantitative and qualitative data were collected and used to describe students' IL development over 3 years. Statistics from the quantitative data were analyzed in SPSS. The qualitative data was coded and analyzed thematically in NVivo. The qualitative, textual data is from semi-structured interviews with sixth-semester students in psychology at UiT, both focus groups and individual interviews. All data were collected as part of the contact author's PhD research on information literacy (IL) at UiT. The following files are included in this data set: 1. A README file which explains the quantitative data files. (2 file formats: .txt, .pdf)2. The consent form for participants (in Norwegian). (2 file formats: .txt, .pdf)3. Six data files with survey results from UiT psychology undergraduate students for the cross-sectional (n=209) and longitudinal (n=56) samples, in 3 formats (.dat, .csv, .sav). The data was collected in Qualtrics from fall 2019 to fall 2022. 4. Interview guide for 3 focus group interviews. File format: .txt5. Interview guides for 7 individual interviews - first round (n=4) and second round (n=3). File format: .txt 6. The 21-item IL test (Tromsø Information Literacy Test = TILT), in English and Norwegian. TILT is used for assessing students' knowledge of three aspects of IL: evaluating sources, using sources, and seeking information. The test is multiple choice, with four alternative answers for each item. This test is a "KNOW-measure," intended to measure what students know about information literacy. (2 file formats: .txt, .pdf)7. Survey questions related to interest - specifically students' interest in being or becoming information literate - in 3 parts (all in English and Norwegian): a) information and questions about the 4 phases of interest; b) interest questionnaire with 26 items in 7 subscales (Tromsø Interest Questionnaire - TRIQ); c) Survey questions about IL and interest, need, and intent. (2 file formats: .txt, .pdf)8. Information about the assignment-based measures used to measure what students do in practice when evaluating and using sources. Students were evaluated with these measures in their first and sixth semesters. (2 file formats: .txt, .pdf)9. The Norwegain Centre for Research Data's (NSD) 2019 assessment of the notification form for personal data for the PhD research project. In Norwegian. (Format: .pdf)

  3. d

    Data from: Data sets for a quantitative dye tracer test conducted at the...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Data sets for a quantitative dye tracer test conducted at the Savoy Experimental Watershed, November 13-December 2, 2017, Savoy, Arkansas [Dataset]. https://catalog.data.gov/dataset/data-sets-for-a-quantitative-dye-tracer-test-conducted-at-the-savoy-experimental-watershed
    Explore at:
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Savoy, Arkansas
    Description

    These are the data sets in machine readable files from a quantitative dye tracer test conducted at Langle Spring November 13-December 2, 2017 as part of the USGS training class, GW2227 Advanced Field Methods in Karst Terrains, held at the Savoy Experimental Watershed, Savoy Arkansas. Langle Spring is NWIS site 71948218, latitude 36.11896886, longitude -94.34548871. One pound of RhodamineWT dye was injected into a sinking stream at latitude 36.116772 longitude -94.341883 NAD83 on November 13, 2017 at 22:50. The data sets include original fluorimeter data logger files from Langle and Copperhead Springs, Laboratory Sectra-fluorometer files from standards and grab samples, and processed input and output files from the breakthrough curve analysis program Qtracer2 (Field, USEPA, 2002 EPA/600/R-02/001).

  4. Clustering of samples and variables with mixed-type data

    • plos.figshare.com
    tiff
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuela Hummel; Dominic Edelmann; Annette Kopp-Schneider (2023). Clustering of samples and variables with mixed-type data [Dataset]. http://doi.org/10.1371/journal.pone.0188274
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Manuela Hummel; Dominic Edelmann; Annette Kopp-Schneider
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of data measured on different scales is a relevant challenge. Biomedical studies often focus on high-throughput datasets of, e.g., quantitative measurements. However, the need for integration of other features possibly measured on different scales, e.g. clinical or cytogenetic factors, becomes increasingly important. The analysis results (e.g. a selection of relevant genes) are then visualized, while adding further information, like clinical factors, on top. However, a more integrative approach is desirable, where all available data are analyzed jointly, and where also in the visualization different data sources are combined in a more natural way. Here we specifically target integrative visualization and present a heatmap-style graphic display. To this end, we develop and explore methods for clustering mixed-type data, with special focus on clustering variables. Clustering of variables does not receive as much attention in the literature as does clustering of samples. We extend the variables clustering methodology by two new approaches, one based on the combination of different association measures and the other on distance correlation. With simulation studies we evaluate and compare different clustering strategies. Applying specific methods for mixed-type data proves to be comparable and in many cases beneficial as compared to standard approaches applied to corresponding quantitative or binarized data. Our two novel approaches for mixed-type variables show similar or better performance than the existing methods ClustOfVar and bias-corrected mutual information. Further, in contrast to ClustOfVar, our methods provide dissimilarity matrices, which is an advantage, especially for the purpose of visualization. Real data examples aim to give an impression of various kinds of potential applications for the integrative heatmap and other graphical displays based on dissimilarity matrices. We demonstrate that the presented integrative heatmap provides more information than common data displays about the relationship among variables and samples. The described clustering and visualization methods are implemented in our R package CluMix available from https://cran.r-project.org/web/packages/CluMix.

  5. b

    The ASK feasibility trial quantitative data - Datasets - data.bris

    • data.bris.ac.uk
    Updated Jul 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). The ASK feasibility trial quantitative data - Datasets - data.bris [Dataset]. https://data.bris.ac.uk/data/dataset/2b9vlo0wejnsh2nfoa6fka66cx
    Explore at:
    Dataset updated
    Jul 9, 2024
    Description

    In order to request access to this data please complete the data request form.* * University of Bristol staff should use this form instead. The ASK feasibility trial: a randomised controlled feasibility trial and process evaluation of a complex multicomponent intervention to improve AccesS to living-donor Kidney transplantation This trial was a two-arm, parallel group, pragmatic, individually-randomised, controlled, feasibility trial, comparing usual care with a multicomponent intervention to increase access to living-donor kidney transplantation. The trial was based at two UK hospitals: a transplanting hospital and a non-transplanting referral hospital. 62 participants were recruited. 60 participants consented to data sharing, and their trial data is available here. 2 participants did not consent to data sharing and their data is not available. This project contains: 1. The ASK feasibility trial dataset 2. The trial questionnaire 3. An example consent form 4. Trial information sheet This dataset is part of a series: ASK feasibility trial documents: https://doi.org/10.5523/bris.1u5ooi0iqmb5c26zwim8l7e8rm The ASK feasibility trial: CONSORT documents: https://doi.org/10.5523/bris.2iq6jzfkl6e1x2j1qgfbd2kkbb The ASK feasibility trial: Wellcome Open Research CONSORT checklist: https://doi.org/10.5523/bris.1m3uhbdfdrykh27iij5xck41le The ASK feasibility trial: qualitative data: https://doi.org/10.5523/bris.1qm9yblprxuj2qh3o0a2yylgg

  6. s

    MINUTE-ChIP example data

    • figshare.scilifelab.se
    txt
    Updated Jan 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carmen Navarro Luzon; Simon Elsässer (2025). MINUTE-ChIP example data [Dataset]. http://doi.org/10.17044/scilifelab.25348405.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 15, 2025
    Dataset provided by
    Karolinska Institutet
    Authors
    Carmen Navarro Luzon; Simon Elsässer
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    This collection contains an example MINUTE-ChIP dataset to run minute pipeline on, provided as supporting material to help users understand the results of a MINUTE-ChIP experiment from raw data to a primary analysis that yields the relevant files for downstream analysis along with summarized QC indicators. Example primary non-demultiplexed FASTQ files provided here were used to generate GSM5493452-GSM5493463 (H3K27m3) and GSM5823907-GSM5823918 (Input), deposited on GEO with the minute pipeline all together under series GSE181241. For more information about MINUTE-ChIP, you can check the publication relevant to this dataset: Kumar, Banushree, et al. "Polycomb repressive complex 2 shields naïve human pluripotent cells from trophectoderm differentiation." Nature Cell Biology 24.6 (2022): 845-857. If you want more information about the minute pipeline, there is a public biorXiv and a GitHub repository and official documentation.

  7. A quantitative data integration analysis method for cross-studies:...

    • figshare.com
    zip
    Updated May 28, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rong Zhou; Shengrong Zhou; Qiguan Xia; Tiejun Zhang; Guoqing Zhang (2022). A quantitative data integration analysis method for cross-studies: Obstructive sleep apnea as an example [Dataset]. http://doi.org/10.6084/m9.figshare.19179158.v4
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 28, 2022
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Rong Zhou; Shengrong Zhou; Qiguan Xia; Tiejun Zhang; Guoqing Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Supplementary Material for this article:Supplementary Table 1 | PRISMA 2009 checklist.Supplementary Table 2 | Search strategies.Supplementary Table 3 | Research information details and indicators dataset.Supplementary Table 4 | Specific information of indicators and studies under different profile.Supplementary Table 5 | Analysis results data.

  8. D

    Replication Data for: The Choice of Aspect in the Russian Modal Construction...

    • dataverse.no
    • search.dataone.org
    csv, pdf, tsv, txt
    Updated Sep 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Beatrice Bernasconi; Beatrice Bernasconi (2023). Replication Data for: The Choice of Aspect in the Russian Modal Construction with prixodit'sja/prijtis' [Dataset]. http://doi.org/10.18710/KR5RRK
    Explore at:
    pdf(133994), txt(731), csv(197397), txt(1563), pdf(668808), txt(1523), txt(134354), txt(3307), txt(2842), tsv(70878), pdf(992161)Available download formats
    Dataset updated
    Sep 28, 2023
    Dataset provided by
    DataverseNO
    Authors
    Beatrice Bernasconi; Beatrice Bernasconi
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    1950 - 2020
    Area covered
    Russian Federation
    Description

    This dataset includes all the data files that were used for the studies in my Master Thesis: "The Choice of Aspect in the Russian Modal Construction with prixodit'sja/prijtis'". The data files are numbered so that they are shown in the same order as they are presented in the thesis. They include the database and the code used for the statistical analysis. Their contents are described in the ReadMe files. The core of the work is a quantitative and empirical study on the choice of aspect by Russian native speakers in the modal construction prixodit’sja/prijtis’ + inf. The hypothesis is that in the modal construction prixodit’sja/prijtis’ + inf the aspect of the infinitive is not fully determined by grammatical context but, to some extent, open to construal. A preliminary analysis was carried out on data gathered from the Russian National Corpus (www.ruscorpora.ru). Four hundred and forty-seven examples with the verb prijtis' were annotated manually for several factors and a statistical test (CART) was run. Results demonstrated that no grammatical factor plays a big role in the use of one aspect rather than the other. Data for this study can be consulted in the files from 01 to 03 and include a ReadMe file, the database in .csv format and the code used for the statistical test. An experiment with native speakers was then carried out. A hundred and ten native speakers of Russian were surveyed and asked to evaluate the acceptability of the infinitive in examples with prixodit’sja/prijtis’ delat’/sdelat’ šag/vid/vybor. The survey presented seventeen examples from the Russian National Corpus that were submitted two times: the first time with the same aspect as in the original version, the second time with the other aspect. Participants had to evaluate each case by choosing among “Impossible”, “Acceptable” and “Excellent” ratings. They were also allowed to give their opinion about the difference between aspects in each example. A Logistic Regression with Mixed Effects was run on the answers. Data for this study can be consulted in the files from 04 to 010 and include a ReadMe file, the text and the answers of the questionnaire, the database in .csv, .txt and pdf formats and the code used for the statistical test. Results showed that prijtis’ often admits both aspects in the infinitive, while prixodit’sja is more restrictive and prefers imperfective. Overall, “Acceptable” and “Excellent” responses were higher than “Impossible” responses for both aspects, even when the aspect evaluated didn’t match with the original. Personal opinions showed that the choice of aspect often depends on the meaning the speaker wants to convey. Only in very few cases the grammatical context was considered to be a constraint on the choice.

  9. p

    Core Service Review - Quantitative Data - Dataset - CKAN

    • ckan0.cf.opendata.inter.prod-toronto.ca
    Updated Jul 25, 2011
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2011). Core Service Review - Quantitative Data - Dataset - CKAN [Dataset]. https://ckan0.cf.opendata.inter.prod-toronto.ca/dataset/core-service-review-quantitative-data
    Explore at:
    Dataset updated
    Jul 25, 2011
    Description

    To address Toronto's 2012 budget gap of $774 million, City Council has launched a review of all of its services and implemented a multi-year financial planning process. This data set contains the responses to the multiple- choice questions on the Core Services Review Public Consultation Feedback Form from members of the public. Approximately 13,000 responses were received (full and partial). The consultation was held between May 11 and June 17, 2011. As a public consultation, respondents chose to participate, and chose which questions to answer. This produced a self-selected sample of respondents. The majority of the responses were from City of Toronto residents. There were some responses from GTA residents. City staff reviewed the data and removed personal information and input violating city policies (for example, contravenes the City's current anti-discrimination policy or confidentiality policy). The .SAV file may be viewed with Statistics software such as SPSS or SAS.

  10. e

    Qualitative and quantitative data from contexts of use for the analysis of...

    • data.europa.eu
    • datos.cchs.csic.es
    unknown
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agencia Estatal Consejo Superior de Investigaciones Científicas, Qualitative and quantitative data from contexts of use for the analysis of six terminological units in a covid-19 corpora [Dataset]. https://data.europa.eu/data/datasets/http-hdl-handle-net-10261-266302?locale=el
    Explore at:
    unknown(641792), unknown(7065), unknown(44103)Available download formats
    Dataset authored and provided by
    Agencia Estatal Consejo Superior de Investigaciones Científicas
    License

    Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
    License information was derived automatically

    Description

    This dataset compiles examples of use of the following terms: covid-19, coronavirus, confinamiento, SARS-CoV-2, pandemia and virus. This are selected in a double quantitative and qualitative methodology from the linguistic corpora in Spanish of scientific dissemination texts from The Conversation.

  11. d

    A quantitative analysis of non-coral communities at sites along a water...

    • catalog.data.gov
    • s.cnmilf.com
    • +1more
    Updated Nov 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact) (2025). A quantitative analysis of non-coral communities at sites along a water quality gradient in the bay off of Aua, American Samoa: taxa species and counts of macroinvertebrates, benthic microalgae, and benthic foraminifera from samples collected between 2022-09-11 to 2022-09-26 (NCEI Accession 0284083) [Dataset]. https://catalog.data.gov/dataset/a-quantitative-analysis-of-non-coral-communities-at-sites-along-a-water-quality-gradient-in-the
    Explore at:
    Dataset updated
    Nov 1, 2025
    Dataset provided by
    (Point of Contact)
    Area covered
    Aua
    Description

    This data package includes a quantitative analysis of non-coral communities at sites along a water quality gradient off of Aua, American Samoa in 2022. These datasets were funded by the NOAA Coral Reef Conservation Program (CRCP) Project Number 31303 to study effects of land-based sources of pollution (LBSP) in Aua, American Samoa. In September 2022 Ecosystem Sciences Division (ESD) scientists of the Pacific Islands Fisheries Science Center (PIFSC) flew into American Samoa to survey 18 sites along a water quality gradient off of Aua. Three datasets are provided of the taxa species and counts of benthic foraminifera, benthic microalgae, and macroinvertebrates from samples collected between 9-28 September 2022. Samples were preserved in the field, and brought back to the NOAA Inouye Regional Center (IRC) and analyzed via microscopy. To analyze benthic foraminifera abundance, sediment samples were collected using a small sediment corer (60 ml syringe with the tip removed and a stopper placed there instead). Only the top 3 cm were retained. Under the microscope, benthic foraminifera were picked out of the sediment and identified. To analyze benthic microalgae, microscope slides were deployed on the seafloor at each site for 2 to 3 weeks. The benthic microalgae that settled was fixed and preserved in Lugol's solution and analyzed. Microalgae included diatoms (pennate and centric), dinoflagellates, chlorophyta, and cyanobacteria. To analyze macroinvertebrates: plastic scouring pads were deployed and attached to the substratum with zip-ties for 2 to 3 weeks. The scouring pads were removed after the settlement period, all plastic zip-ties and additional waste were removed from the reef. Macroinvertebrates that settled on the scouring pad were placed into sampling jars and fixed and preserved with 4% formalin, and subsequently analyzed under the microscope. These quantitative non-coral community surveys were one of several surveys conducted at the same sites across Aua reef in September 2022. Other surveys described and archived separately include surveys of water quality, CTD casts, coral demography, benthic imagery/benthic cover, and coral demography. These can be accessed under the 'Related Items' section of the InPort metadata record.

  12. Fama–French Factors and Portfolios

    • kaggle.com
    zip
    Updated Oct 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nikita Manaenkov (2025). Fama–French Factors and Portfolios [Dataset]. https://www.kaggle.com/datasets/nikitamanaenkov/famafrench-factors-and-portfolios
    Explore at:
    zip(177539895 bytes)Available download formats
    Dataset updated
    Oct 30, 2025
    Authors
    Nikita Manaenkov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset provides foundational factor and portfolio return data used in empirical finance and asset pricing research. It contains: - Fama–French 3-Factor and 5-Factor models - Size (ME), Book-to-Market (B/M), Operating Profitability (OP), and Investment (Inv) portfolios - Bivariate portfolios (e.g., 2x3 Size-B/M sorts) - Industry portfolio returns All data originate from the Kenneth R. French Data Library and are based on CRSP and Compustat databases. Data are value-weighted and expressed in percentages.

    Some files in this dataset contain header comments describing data sources and methodology (as shown below):

    This file was created using the 202508 CRSP database.
    The 1-month TBill rate data until 202405 are from Ibbotson Associates. 
    Starting from 202406, the 1-month TBill rate is from ICE BofA US 1-Month Treasury Bill Index.
    

    To correctly read such files in Python (pandas), use the comment parameter — it automatically ignores all lines starting with a specific symbol (e.g., none here, so you can skip manually):

    Example 1 — Automatically detect header rows:

    import pandas as pd
    
    # Detect the first numeric line to find where data starts
    file_path = "F-F_Research_Data_5_Factors_2x3.csv"
    
    with open(file_path) as f:
      lines = f.readlines()
    
    # Find where the header line (column names) appears
    for i, line in enumerate(lines):
      if "Mkt-RF" in line:
        skip_rows = i
        break
    
    df = pd.read_csv(file_path, skiprows=skip_rows, sep=r"\s+")
    print(df.head())
    

    Example 2 — Skip a known number of comment lines manually:

    df = pd.read_csv("F-F_Research_Data_5_Factors_2x3.csv", skiprows=3, sep=r"\s+")
    

    Example 3 — If comments are prefixed (e.g., with #):

    df = pd.read_csv("F-F_Research_Data_5_Factors_2x3.csv", comment="#", sep=",")
    

    File Structure Example

    ColumnDescription
    Mkt-RFMarket excess return
    SMBSmall minus Big (size factor)
    HMLHigh minus Low (book-to-market factor)
    RMWRobust minus Weak (profitability factor)
    CMAConservative minus Aggressive (investment factor)
    RFRisk-free rate (1-month Treasury Bill)
  13. f

    Data from: Statistical Design of Quantitative Mass Spectrometry-Based...

    • acs.figshare.com
    zip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ann L. Oberg; Olga Vitek (2023). Statistical Design of Quantitative Mass Spectrometry-Based Proteomic Experiments [Dataset]. http://doi.org/10.1021/pr8010099.s001
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    ACS Publications
    Authors
    Ann L. Oberg; Olga Vitek
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    We review the fundamental principles of statistical experimental design, and their application to quantitative mass spectrometry-based proteomics. We focus on class comparison using Analysis of Variance (ANOVA), and discuss how randomization, replication and blocking help avoid systematic biases due to the experimental procedure, and help optimize our ability to detect true quantitative changes between groups. We also discuss the issues of pooling multiple biological specimens for a single mass analysis, and calculation of the number of replicates in a future study. When applicable, we emphasize the parallels between designing quantitative proteomic experiments and experiments with gene expression microarrays, and give examples from that area of research. We illustrate the discussion using theoretical considerations, and using real-data examples of profiling of disease.

  14. Z

    Data from: Automated Qualitative and Quantitative Analysis of Complex...

    • nde-dev.biothings.io
    • data.niaid.nih.gov
    • +1more
    Updated Feb 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard R. Laing (2025). Automated Qualitative and Quantitative Analysis of Complex Forensic Drug Samples using 1H NMR [Dataset]. https://nde-dev.biothings.io/resources?id=zenodo_5933787
    Explore at:
    Dataset updated
    Feb 11, 2025
    Dataset provided by
    M. Mehr, S. Hessam
    Richard R. Laing
    Aaron W. Tang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset to accompany the manuscript "Automated Qualitative and Quantitative Analysis of Complex Forensic Drug Samples using 1H NMR"

  15. TIDieR items and examples of data extracted.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Titilayo Tatiana Agbadjé; Paula Riganti; Évèhouénou Lionel Adisso; Rhéda Adekpedjou; Alexandrine Boucher; Andressa Teoli Nunciaroni; Juan Victor Ariel Franco; Maria Victoria Ruiz Yanzi; France Légaré (2023). TIDieR items and examples of data extracted. [Dataset]. http://doi.org/10.1371/journal.pone.0265401.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 15, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Titilayo Tatiana Agbadjé; Paula Riganti; Évèhouénou Lionel Adisso; Rhéda Adekpedjou; Alexandrine Boucher; Andressa Teoli Nunciaroni; Juan Victor Ariel Franco; Maria Victoria Ruiz Yanzi; France Légaré
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    TIDieR items and examples of data extracted.

  16. o

    Historical quantitative benthos grab samples from the Southern Baltic Sea -...

    • obis.org
    • erddap.eurobis.org
    • +3more
    zip
    Updated Sep 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vlaams Instituut voor de Zee (2025). Historical quantitative benthos grab samples from the Southern Baltic Sea - German data [Dataset]. https://obis.org/dataset/cad31b12-0f59-4abf-9109-cb09ea40c297
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 17, 2025
    Dataset authored and provided by
    Vlaams Instituut voor de Zee
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    1911 - 1970
    Area covered
    Baltic Sea
    Description

    Historical quantitative benthos grab samples from the Southern Baltic Sea, collected by German researchers. The dataset was digitised at the Christian-Albrechts-University Kiel; Leibniz Institute of Marine Sciences; Marine Ecology Division; Benthos Ecology section.

  17. d

    Data from: tableone: An open source Python package for producing summary...

    • datadryad.org
    • search.dataone.org
    • +1more
    zip
    Updated Apr 23, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tom J. Pollard; Alistair E. W. Johnson; Jesse D. Raffa; Roger G. Mark (2019). tableone: An open source Python package for producing summary statistics for research papers [Dataset]. http://doi.org/10.5061/dryad.26c4s35
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 23, 2019
    Dataset provided by
    Dryad
    Authors
    Tom J. Pollard; Alistair E. W. Johnson; Jesse D. Raffa; Roger G. Mark
    Time period covered
    Apr 19, 2018
    Description

    Objectives: In quantitative research, understanding basic parameters of the study population is key for interpretation of the results. As a result, it is typical for the first table (“Table 1”) of a research paper to include summary statistics for the study data. Our objectives are 2-fold. First, we seek to provide a simple, reproducible method for providing summary statistics for research papers in the Python programming language. Second, we seek to use the package to improve the quality of summary statistics reported in research papers.

    Materials and Methods: The tableone package is developed following good practice guidelines for scientific computing and all code is made available under a permissive MIT License. A testing framework runs on a continuous integration server, helping to maintain code stability. Issues are tracked openly and public contributions are encouraged.

    Results: The tableone software package automatically compiles summary statistics into publishable formats such...

  18. Z

    Data from: Supplementary Materials: A primer on gathering and analysing...

    • data.niaid.nih.gov
    • researchportal.scu.edu.au
    • +1more
    Updated Jan 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kieran Balloo; Naomi E. Winstone (2021). Supplementary Materials: A primer on gathering and analysing multi-level quantitative evidence for differential student outcomes in higher education [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4115263
    Explore at:
    Dataset updated
    Jan 27, 2021
    Dataset provided by
    University of Surrey
    Authors
    Kieran Balloo; Naomi E. Winstone
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Example data sets, syntax files and macros for the tutorials in: Balloo, K., & Winstone, N. E. (2021). A primer on gathering and analysing multi-level quantitative evidence for differential student outcomes in higher education. Frontline Learning Research. https://doi.org/10.14786/flr.v9i2.675

    The data for all examples are fictional, and have only been designed to simulate the possible behaviour of institutional data for the purposes of demonstrating the analytical approaches in the primer. No inferences or conclusions should be drawn from the findings of these examples, because the results are not real.

    We anticipate that readers can use the example data sets as templates and substitute in their own data.

  19. Quantitative and qualitative data relating to the inhibition of a...

    • ckan.publishing.service.gov.uk
    Updated Jun 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2024). Quantitative and qualitative data relating to the inhibition of a phosphatase reaction by microcystin on a paper-based analytical device (PAD) - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/quantitative-and-qualitative-data-relating-to-the-inhibition-of-a-phosphatase-reaction-by-micro
    Explore at:
    Dataset updated
    Jun 29, 2024
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    This dataset contains information about the development of a paper analytical device for the detection of the microcystin toxin. Water samples were collected weekly from Rostherne Mere and Tatton Park Lake, Cheshire between 20/07/2022 and 12/10/2022. Samples were analysed for the presence of microcystin-producing genes and released microcystin. A paper analytical device incorporating a protein phosphatase inhibition assay was also used to monitor microcystin levels. Preliminary user evaluation of the paper analytical devices and associated mobile photo applications is also provided. The work was supported by the Natural Environment Research Council (Grant NE/X011607/1). Full details about this dataset can be found at https://doi.org/10.5285/5fe25b28-f10b-467b-8812-0b1ee6fd7491

  20. e

    GAPs Data Repository on Return: Guideline, Data Samples and Codebook

    • data.europa.eu
    • data.niaid.nih.gov
    unknown
    Updated Feb 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RedCAP (2025). GAPs Data Repository on Return: Guideline, Data Samples and Codebook [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-14862490?locale=pt
    Explore at:
    unknown(3802528)Available download formats
    Dataset updated
    Feb 12, 2025
    Dataset authored and provided by
    RedCAP
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The GAPs Data Repository provides a comprehensive overview of available qualitative and quantitative data on national return regimes, now accessible through an advanced web interface at https://data.returnmigration.eu/. This updated guideline outlines the complete process, starting from the initial data collection for the return migration data repository to the development of a comprehensive web-based platform. Through iterative development, participatory approaches, and rigorous quality checks, we have ensured a systematic representation of return migration data at both national and comparative levels. The Repository organizes data into five main categories, covering diverse aspects and offering a holistic view of return regimes: country profiles, legislation, infrastructure, international cooperation, and descriptive statistics. These categories, further divided into subcategories, are based on insights from a literature review, existing datasets, and empirical data collection from 14 countries. The selection of categories prioritizes relevance for understanding return and readmission policies and practices, data accessibility, reliability, clarity, and comparability. Raw data is meticulously collected by the national experts. The transition to a web-based interface builds upon the Repository’s original structure, which was initially developed using REDCap (Research Electronic Data Capture). It is a secure web application for building and managing online surveys and databases.The REDCAP ensures systematic data entries and store them on Uppsala University’s servers while significantly improving accessibility and usability as well as data security. It also enables users to export any or all data from the Project when granted full data export privileges. Data can be exported in various ways and formats, including Microsoft Excel, SAS, Stata, R, or SPSS for analysis. At this stage, the Data Repository design team also converted tailored records of available data into public reports accessible to anyone with a unique URL, without the need to log in to REDCap or obtain permission to access the GAPs Project Data Repository. Public reports can be used to share information with stakeholders or external partners without granting them access to the Project or requiring them to set up a personal account. Currently, all public report links inserted in this report are also available on the Repository’s webpage, allowing users to export original data. This report also includes a detailed codebook to help users understand the structure, variables, and methodologies used in data collection and organization. This addition ensures transparency and provides a comprehensive framework for researchers and practitioners to effectively interpret the data. The GAPs Data Repository is committed to providing accessible, well-organized, and reliable data by moving to a centralized web platform and incorporating advanced visuals. This Repository aims to contribute inputs for research, policy analysis, and evidence-based decision-making in the return and readmission field. Explore the GAPs Data Repository at https://data.returnmigration.eu/.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Srinvivas Murthy; Maggie Woo Kinshella; Jessica Trawin; Teresa Johnson; Niranjan Kissoon; Matthew Wiens; Gina Ogilvie; Gurm Dhugga; J Mark Ansermino (2023). Open Data Training Workshop: Case Studies in Open Data for Qualitative and Quantitative Clinical Research [Dataset]. http://doi.org/10.5683/SP3/BNNAE7

Open Data Training Workshop: Case Studies in Open Data for Qualitative and Quantitative Clinical Research

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 18, 2023
Dataset provided by
Borealis
Authors
Srinvivas Murthy; Maggie Woo Kinshella; Jessica Trawin; Teresa Johnson; Niranjan Kissoon; Matthew Wiens; Gina Ogilvie; Gurm Dhugga; J Mark Ansermino
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Dataset funded by
Digital Research Alliance of Canada
Description

Objective(s): Momentum for open access to research is growing. Funding agencies and publishers are increasingly requiring researchers make their data and research outputs open and publicly available. However, clinical researchers struggle to find real-world examples of Open Data sharing. The aim of this 1 hr virtual workshop is to provide real-world examples of Open Data sharing for both qualitative and quantitative data. Specifically, participants will learn: 1. Primary challenges and successes when sharing quantitative and qualitative clinical research data. 2. Platforms available for open data sharing. 3. Ways to troubleshoot data sharing and publish from open data. Workshop Agenda: 1. “Data sharing during the COVID-19 pandemic” - Speaker: Srinivas Murthy, Clinical Associate Professor, Department of Pediatrics, Faculty of Medicine, University of British Columbia. Investigator, BC Children's Hospital 2. “Our experience with Open Data for the 'Integrating a neonatal healthcare package for Malawi' project.” - Speaker: Maggie Woo Kinshella, Global Health Research Coordinator, Department of Obstetrics and Gynaecology, BC Children’s and Women’s Hospital and University of British Columbia This workshop draws on work supported by the Digital Research Alliance of Canada. Data Description: Presentation slides, Workshop Video, and Workshop Communication Srinivas Murthy: Data sharing during the COVID-19 pandemic presentation and accompanying PowerPoint slides. Maggie Woo Kinshella: Our experience with Open Data for the 'Integrating a neonatal healthcare package for Malawi' project presentation and accompanying Powerpoint slides. This workshop was developed as part of Dr. Ansermino's Data Champions Pilot Project supported by the Digital Research Alliance of Canada. NOTE for restricted files: If you are not yet a CoLab member, please complete our membership application survey to gain access to restricted files within 2 business days. Some files may remain restricted to CoLab members. These files are deemed more sensitive by the file owner and are meant to be shared on a case-by-case basis. Please contact the CoLab coordinator on this page under "collaborate with the pediatric sepsis colab."

Search
Clear search
Close search
Google apps
Main menu