100+ datasets found
  1. Using Descriptive Statistics to Analyse Data in R

    • kaggle.com
    zip
    Updated May 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enrico68 (2024). Using Descriptive Statistics to Analyse Data in R [Dataset]. https://www.kaggle.com/datasets/enrico68/using-descriptive-statistics-to-analyse-data-in-r
    Explore at:
    zip(105561 bytes)Available download formats
    Dataset updated
    May 9, 2024
    Authors
    Enrico68
    Description

    Load and view a real-world dataset in RStudio

    • Calculate “Measure of Frequency” metrics

    • Calculate “Measure of Central Tendency” metrics

    • Calculate “Measure of Dispersion” metrics

    • Use R’s in-built functions for additional data quality metrics

    • Create a custom R function to calculate descriptive statistics on any given dataset

  2. R script for summary statistics and structural equation modelling

    • figshare.com
    txt
    Updated Feb 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eleanor Durrant; Marion Pfeifer (2024). R script for summary statistics and structural equation modelling [Dataset]. http://doi.org/10.6084/m9.figshare.25226258.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Feb 15, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Eleanor Durrant; Marion Pfeifer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    R script used with accompanying data frame 'plot_character' that is within the project to calculate summary statistics and structural equation modelling.

  3. E

    Data from: STAD-R Descriptive statistics for experimental designs

    • data.moa.gov.et
    html
    Updated Jan 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CIMMYT Ethiopia (2025). STAD-R Descriptive statistics for experimental designs [Dataset]. https://data.moa.gov.et/dataset/hdl-11529-10853
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jan 20, 2025
    Dataset provided by
    CIMMYT Ethiopia
    Description

    STAD-R is a set of R programs that performs descriptive statistics, in order to make boxplots and histograms. STAD-R was designed because is necessary before than the thing, check if the dataset have the same number of repetitions, blocks, genotypes, environments, if we have missing values, where and how many, review the distributions and outliers, because is important to be sure that the dataset is complete and have the correct structure for do and other kind of analysis.

  4. b

    Guidelines for Computing Summary Statistics for Data-Sets Containing...

    • datahub.bvcentre.ca
    Updated Jun 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Guidelines for Computing Summary Statistics for Data-Sets Containing Non-Detects - Dataset - BVRC DataHub [Dataset]. https://datahub.bvcentre.ca/dataset/guidelines-for-computing-summary-statistics-for-data-sets-containing-non-detects
    Explore at:
    Dataset updated
    Jun 3, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    INTRODUCTION As part of its responsibilities, the BC Ministry of Environment monitors water quality in the province’s streams, rivers, and lakes. Often, it is necessary to compile statistics involving concentrations of contaminants or other compounds. Quite often the instruments used cannot measure concentrations below certain values. These observations are called non-detects or less thans. However, non-detects pose a difficulty when it is necessary to compute statistical measurements such as the mean, the median, and the standard deviation for a data set. The way non-detects are handled can affect the quality of any statistics generated. Non-detects, or censored data are found in many fields such as medicine, engineering, biology, and environmetrics. In such fields, it is often the case that the measurements of interest are below some threshold. Dealing with non-detects is a significant issue and statistical tools using survival or reliability methods have been developed. Basically, there are three approaches for treating data containing censored values: 1. substitution, which gives poor results and therefore, is not recommended in the literature; 2. maximum likelihood estimation, which requires an assumption of some distributional form; and 3. and nonparametric methods which assess the shape of the data based on observed percentiles rather than a strict distributional form. This document provides guidance on how to record censor data, and on when and how to use certain analysis methods when the percentage of censored observations is less than 50%. The methods presented in this document are:1. substitution; 2. Kaplan-Meier, as part of nonparametric methods; 3. lognormal model based on maximum likelihood estimation; 4. and robust regression on order statistics, which is a semiparametric method. Statistical software suitable for survival or reliability analysis is available for dealing with censored data. This software has been widely used in medical and engineering environments. In this document, methods are illustrated with both R and JMP software packages, when possible. JMP often requires some intermediate steps to obtain summary statistics with most of the methods described in this document. R, with the NADA package is usually straightforward. The package NADA was developed specifically for computing statistics with non-detects in environmental data based on Helsel (2005b). The data used to illustrate the methods described for computing summary statistics for non-detects are either simulated or based on information acquired from the B.C. Ministry of Environment. This document is strongly based on the book Nondetects And Data Analysis written by Dennis R. Helsel in 2005 (Helsel, 2005b).

  5. f

    Data_Sheet_3_“R” U ready?: a case study using R to analyze changes in gene...

    • frontiersin.figshare.com
    • figshare.com
    docx
    Updated Mar 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amy E. Pomeroy; Andrea Bixler; Stefanie H. Chen; Jennifer E. Kerr; Todd D. Levine; Elizabeth F. Ryder (2024). Data_Sheet_3_“R” U ready?: a case study using R to analyze changes in gene expression during evolution.docx [Dataset]. http://doi.org/10.3389/feduc.2024.1379910.s003
    Explore at:
    docxAvailable download formats
    Dataset updated
    Mar 22, 2024
    Dataset provided by
    Frontiers
    Authors
    Amy E. Pomeroy; Andrea Bixler; Stefanie H. Chen; Jennifer E. Kerr; Todd D. Levine; Elizabeth F. Ryder
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    As high-throughput methods become more common, training undergraduates to analyze data must include having them generate informative summaries of large datasets. This flexible case study provides an opportunity for undergraduate students to become familiar with the capabilities of R programming in the context of high-throughput evolutionary data collected using macroarrays. The story line introduces a recent graduate hired at a biotech firm and tasked with analysis and visualization of changes in gene expression from 20,000 generations of the Lenski Lab’s Long-Term Evolution Experiment (LTEE). Our main character is not familiar with R and is guided by a coworker to learn about this platform. Initially this involves a step-by-step analysis of the small Iris dataset built into R which includes sepal and petal length of three species of irises. Practice calculating summary statistics and correlations, and making histograms and scatter plots, prepares the protagonist to perform similar analyses with the LTEE dataset. In the LTEE module, students analyze gene expression data from the long-term evolutionary experiments, developing their skills in manipulating and interpreting large scientific datasets through visualizations and statistical analysis. Prerequisite knowledge is basic statistics, the Central Dogma, and basic evolutionary principles. The Iris module provides hands-on experience using R programming to explore and visualize a simple dataset; it can be used independently as an introduction to R for biological data or skipped if students already have some experience with R. Both modules emphasize understanding the utility of R, rather than creation of original code. Pilot testing showed the case study was well-received by students and faculty, who described it as a clear introduction to R and appreciated the value of R for visualizing and analyzing large datasets.

  6. f

    Summary statistics of variables used in analyses.

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    • +1more
    Updated Jan 10, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hipp, John R.; Wickes, Rebecca; Li, Tiebei; Corcoran, Jonathan (2014). Summary statistics of variables used in analyses. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001207119
    Explore at:
    Dataset updated
    Jan 10, 2014
    Authors
    Hipp, John R.; Wickes, Rebecca; Li, Tiebei; Corcoran, Jonathan
    Description

    Note: Sample size is 4,351 respondents in 146 neighborhoods.

  7. Summary statistics.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Camelia M. Kuhnen; Gregory R. Samanez-Larkin; Brian Knutson (2023). Summary statistics. [Dataset]. http://doi.org/10.1371/journal.pone.0054632.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Camelia M. Kuhnen; Gregory R. Samanez-Larkin; Brian Knutson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sample summary statistics for subjects’ real life and experimental financial outcomes, demographic characteristics, and measures of cognitive and affect measures.

  8. u

    Summary statistics for multivariate GWAS extension of cognitive and...

    • rdr.ucl.ac.uk
    application/gzip
    Updated Jun 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrea Allegrini; Margherita Malanchini (2024). Summary statistics for multivariate GWAS extension of cognitive and non-cognitive skills [Dataset]. http://doi.org/10.5522/04/26014027.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jun 25, 2024
    Dataset provided by
    University College London
    Authors
    Andrea Allegrini; Margherita Malanchini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    GWAS summary statistics for multivariate GWAS model extension of cognitive and noncognitive skills. From: 'Malanchini, M., Allegrini, A. G., Nivard, M. G., Biroli, P., Rimfeld, K., Cheesman, R., ... & Plomin, R. (2023). Genetic contributions of noncognitive skills to academic development. Research Square.' Columns: SNP = rsID, CHR = chromosome, BP = position, MAF = minor allele frequency (1000 Genomes Phase 3), A1 = effect allele, A2 = other allele, BETA = estimate of the SNP effect, SE = standard error of BETA, Z = Z-statistic, PVAL = p-value.

  9. Automated_Descriptive_Statistics_Pipeline R Studio

    • kaggle.com
    zip
    Updated Nov 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. Nagendra (2025). Automated_Descriptive_Statistics_Pipeline R Studio [Dataset]. https://www.kaggle.com/datasets/mannekuntanagendra/automated-descriptive-statistics-pipeline-r-studio
    Explore at:
    zip(21548 bytes)Available download formats
    Dataset updated
    Nov 29, 2025
    Authors
    Dr. Nagendra
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    • Automated parametric analysis workflow built using R Studio.
    • Demonstrates core statistical analysis methods on numerical datasets.
    • Includes step-by-step R scripts for performing t-tests, ANOVA, and summary statistics.
    • Provides visual outputs such as boxplots and distribution plots for better interpretation.
    • Designed for students, researchers, and data analysts learning statistical automation in R.
    • Useful for understanding reproducible research workflows in data analysis.
    • Dataset helps in teaching how to automate statistical pipelines using R programming.

  10. Summary descriptive statistics of TIMSS dataset.

    • plos.figshare.com
    xls
    Updated Feb 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Fries; Sandra Oberleiter; Jakob Pietschnig (2024). Summary descriptive statistics of TIMSS dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0297033.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 2, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jonathan Fries; Sandra Oberleiter; Jakob Pietschnig
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Regression ranks among the most popular statistical analysis methods across many research areas, including psychology. Typically, regression coefficients are displayed in tables. While this mode of presentation is information-dense, extensive tables can be cumbersome to read and difficult to interpret. Here, we introduce three novel visualizations for reporting regression results. Our methods allow researchers to arrange large numbers of regression models in a single plot. Using regression results from real-world as well as simulated data, we demonstrate the transformations which are necessary to produce the required data structure and how to subsequently plot the results. The proposed methods provide visually appealing ways to report regression results efficiently and intuitively. Potential applications range from visual screening in the model selection stage to formal reporting in research papers. The procedure is fully reproducible using the provided code and can be executed via free-of-charge, open-source software routines in R.

  11. SYD ALL climate data statistics summary

    • researchdata.edu.au
    Updated Mar 13, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bioregional Assessment Program (2019). SYD ALL climate data statistics summary [Dataset]. https://researchdata.edu.au/syd-all-climate-statistics-summary/2989432
    Explore at:
    Dataset updated
    Mar 13, 2019
    Dataset provided by
    Data.govhttps://data.gov/
    Authors
    Bioregional Assessment Program
    License

    Attribution 2.5 (CC BY 2.5)https://creativecommons.org/licenses/by/2.5/
    License information was derived automatically

    Description

    Abstract \r

    \r The dataset was derived by the Bioregional Assessment Programme from multiple source datasets. The source datasets are identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.\r \r \r \r There are 4 csv files here:\r \r BAWAP_P_annual_BA_SYB_GLO.csv\r \r Desc: Time series mean annual BAWAP rainfall from 1900 - 2012.\r \r Source data: annual BILO rainfall on \\wron\Project\BA\BA_N_Sydney\Working\li036_Lingtao_LI\Grids\BILO_Rain_Ann\\r \r \r \r P_PET_monthly_BA_SYB_GLO.csv\r \r long term average BAWAP rainfall and Penman PET from 198101 - 201212 for each month\r \r \r \r Climatology_Trend_BA_SYB_GLO.csv\r \r Values calculated over the years 1981 - 2012 (inclusive), for 17 time periods (i.e., annual, 4 seasons and 12 months) for the following 8 meteorological variables: (i) BAWAP_P; (ii) Penman ETp; (iii) Tavg; (iv) Tmax; (v) Tmin; (vi) VPD; (vii) Rn; and (viii) Wind speed. For each of the 17 time periods for each of the 8 meteorological variables have calculated the: (a) average; (b) maximum; (c) minimum; (d) average plus standard deviation (stddev); (e) average minus stddev; (f) stddev; and (g) trend\r \r \r \r Risbey_Remote_Rainfall_Drivers_Corr_Coeffs_BA_NSB_GLO.csv\r \r Correlation coefficients (-1 to 1) between rainfall and 4 remote rainfall drivers between 1957-2006 for the four seasons. The data and methodology are described in Risbey et al. (2009). All data used in this analysis came directly from James Risbey, CMAR, Hobart. As described in the Risbey et al. (2009) paper, the rainfall was from 0.05 degree gridded data described in Jeffrey et al. (2001 - known as the SILO datasets); sea surface temperature was from the Hadley Centre Sea Ice and Sea Surface Temperature dataset (HadISST) on a 1 degree grid. BLK=Blocking; DMI=Dipole Mode Index; SAM=Southern Annular Mode; SOI=Southern Oscillation Index; DJF=December, January, February; MAM=March, April, May; JJA=June, July, August; SON=September, October, November. The analysis is a summary of Fig. 15 of Risbey et al. (2009).\r \r

    Dataset History \r

    \r Dataset was created from various BILO source data, including Monthly BILO rainfall, Tmax, Tmin, VPD, etc, and other source data including monthly Penman PET (calculated by Randall Donohue), Correlation coefficient data from James Risbey\r \r

    Dataset Citation \r

    \r Bioregional Assessment Programme (XXXX) SYD ALL climate data statistics summary. Bioregional Assessment Derived Dataset. Viewed 13 March 2019, http://data.bioregionalassessments.gov.au/dataset/b0a6ccf1-395d-430e-adf1-5068f8371dea.\r \r

    Dataset Ancestors \r

    \r * Derived From BILO Gridded Climate Data: Daily Climate Data for each year from 1900 to 2012\r \r

  12. f

    Summary statistics at each timepoint of data collection for participants...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated May 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bell, Michelle L.; Fussell, Elizabeth; Lowe, Sarah R.; Burrows, Kate; Fong, Kelvin C. (2023). Summary statistics at each timepoint of data collection for participants with complete data (n = 229). [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001010050
    Explore at:
    Dataset updated
    May 11, 2023
    Authors
    Bell, Michelle L.; Fussell, Elizabeth; Lowe, Sarah R.; Burrows, Kate; Fong, Kelvin C.
    Description

    Summary statistics at each timepoint of data collection for participants with complete data (n = 229).

  13. Summary statistics for all the variables for each category.

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francisco Javier Moreno-Martínez; Pedro R. Montoro (2023). Summary statistics for all the variables for each category. [Dataset]. http://doi.org/10.1371/journal.pone.0037527.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Francisco Javier Moreno-Martínez; Pedro R. Montoro
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Note: AoA = Age of acquisision; Fam = Familiarity, LF = Lexical frequency (natural logarithm); Man = Manipulability; VC = Visual complexity; % NA = Percentage of name agreement.

  14. f

    Summary statistics of variables used in analyses (N = 81,674).

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Feb 10, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lakon, Cynthia M.; Butts, Carter T.; Wang, Cheng; Jose, Rupa; Hipp, John R. (2021). Summary statistics of variables used in analyses (N = 81,674). [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000925623
    Explore at:
    Dataset updated
    Feb 10, 2021
    Authors
    Lakon, Cynthia M.; Butts, Carter T.; Wang, Cheng; Jose, Rupa; Hipp, John R.
    Description

    Summary statistics of variables used in analyses (N = 81,674).

  15. w

    Dataset of books called An introduction to data analysis in R : hands-on...

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called An introduction to data analysis in R : hands-on coding, data mining, visualization and statistics from scratch [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=An+introduction+to+data+analysis+in+R+%3A+hands-on+coding%2C+data+mining%2C+visualization+and+statistics+from+scratch
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 1 row and is filtered where the book is An introduction to data analysis in R : hands-on coding, data mining, visualization and statistics from scratch. It features 7 columns including author, publication date, language, and book publisher.

  16. u

    Data from: EWAS of lung function in Latinos with asthma - Summary Statistics...

    • portalciencia.ull.es
    • data.niaid.nih.gov
    Updated 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Herrera-Luis, Esther; Li, Annie; Mak, Angel C. Y.; Perez-Garcia, Javier; Elhawary, Jennifer R.; Oh, Sam S.; Hu, Donglei; Eng, Celeste; Keys, Kevin L.; Huntsman, Scott; Beckman, Kenneth B.; Borrell, Luisa N.; Rodriguez-Santana, Jose; Burchard, Esteban G.; Pino-Yanes, Maria; Herrera-Luis, Esther; Li, Annie; Mak, Angel C. Y.; Perez-Garcia, Javier; Elhawary, Jennifer R.; Oh, Sam S.; Hu, Donglei; Eng, Celeste; Keys, Kevin L.; Huntsman, Scott; Beckman, Kenneth B.; Borrell, Luisa N.; Rodriguez-Santana, Jose; Burchard, Esteban G.; Pino-Yanes, Maria (2021). EWAS of lung function in Latinos with asthma - Summary Statistics [Dataset]. https://portalciencia.ull.es/documentos/668fc446b9e7c03b01bd86a2
    Explore at:
    Dataset updated
    2021
    Authors
    Herrera-Luis, Esther; Li, Annie; Mak, Angel C. Y.; Perez-Garcia, Javier; Elhawary, Jennifer R.; Oh, Sam S.; Hu, Donglei; Eng, Celeste; Keys, Kevin L.; Huntsman, Scott; Beckman, Kenneth B.; Borrell, Luisa N.; Rodriguez-Santana, Jose; Burchard, Esteban G.; Pino-Yanes, Maria; Herrera-Luis, Esther; Li, Annie; Mak, Angel C. Y.; Perez-Garcia, Javier; Elhawary, Jennifer R.; Oh, Sam S.; Hu, Donglei; Eng, Celeste; Keys, Kevin L.; Huntsman, Scott; Beckman, Kenneth B.; Borrell, Luisa N.; Rodriguez-Santana, Jose; Burchard, Esteban G.; Pino-Yanes, Maria
    Description

    Summary statistics generated for the manuscript entitled "Epigenome-wide association study of lung function in Latino children and youth with asthma" Our aim was to identify DNA methylation signals associated with lung function in Latino youth with asthma and validate previous epigenetic signals from non-Latino populations. For that, we performed multiple epigenome-wide association studies (EWAS) of lung function measurements analyzing whole blood from 250 Puerto Rican (PR) and 148 Mexican American (MEX) youth with asthma from the Genes-Environment and Admixture in Latino Americans (GALA II) study. The following measurements were evaluated Pre- and post- albuterol administration: Forced expiratory volume in one second (FEV1.Meas), forced vital capacity (FVC.Meas) and their ratio (FEV1.FVC.Meas). DNA methylation was profiled with the Infinium EPIC BeadChip or the Infinium HumanMethylation450 BeadChip array (Illumina, San Diego, CA, USA). The association of methylation beta-values and raw PFT values (in liters) was tested by robust linear regressions with correction for age, sex, height, the first three genotype principal components (PCs), in utero maternal smoking exposure, the first six ReFACTor components, and batch, when appropriate, via limma R package. The results for individuals of the same ethnic subgroup were meta-analyzed using fixed- or random-effects models, based on Cochran's Q p-value. Version 1 is deprecated. The EWAS result files (*.txt) contains: RSID: CpG name. STUDY: Number of sets of individuals included in the meta-analysis. BETA_meta: Coefficient of the regression. SEBETA_meta: Standard error of the coefficient of the regression. PVALUE_meta: P-value for the association. PVALUE_Q: Cochran's Q p-value. Model: Fixed-effect (FE) or Random-effects (RE2) model. PVALUE_meta_adj: False discovery rate (Benjamini & Hochberg method).

  17. f

    Summary statistics of temporal trend analysis (coefficient and R square) for...

    • datasetcatalog.nlm.nih.gov
    Updated Oct 2, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hu, Wenbiao; Akter, Rokeya; Tong, Shilu; Naish, Suchithra (2017). Summary statistics of temporal trend analysis (coefficient and R square) for socio- demographic and ecological variables, (p<0.05). [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001741620
    Explore at:
    Dataset updated
    Oct 2, 2017
    Authors
    Hu, Wenbiao; Akter, Rokeya; Tong, Shilu; Naish, Suchithra
    Description

    Summary statistics of temporal trend analysis (coefficient and R square) for socio- demographic and ecological variables, (p<0.05).

  18. q

    Module M.2 Descriptive statistics

    • qubeshub.org
    Updated Jun 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raisa Hernández-Pacheco; Alexandra Bland (2023). Module M.2 Descriptive statistics [Dataset]. http://doi.org/10.25334/NC22-Q397
    Explore at:
    Dataset updated
    Jun 26, 2023
    Dataset provided by
    QUBES
    Authors
    Raisa Hernández-Pacheco; Alexandra Bland
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Introduction to Primate Data Exploration and Linear Modeling with R was created with the goal of providing training to undergraduate biology students on data management and statistical analysis using authentic data of Cayo Santiago rhesus macaques. Module M.2 introduces basic functions in R, as well as in its packages tidyverse and rstatix, for estimating descriptive statistics.

  19. Chromosome 19 LD data for simulating summary statistics

    • search.datacite.org
    Updated May 30, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jean Morrison (2019). Chromosome 19 LD data for simulating summary statistics [Dataset]. http://doi.org/10.5281/zenodo.3235779
    Explore at:
    Dataset updated
    May 30, 2019
    Dataset provided by
    DataCitehttps://www.datacite.org/
    Zenodohttp://zenodo.org/
    Authors
    Jean Morrison
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This data set contains two files both of which contain R objects.

    chr19_snpdata_hm3only.RDS : A data frame with snp information

    evd_list_chr19_hm3.RDS : A list of eigen decomposition of the SNP correlation matrix spanning chromosome 19

    These data contain only SNPs in both 1k Genomes and HapMap3. Correlation matrices were estimated using LD Shrink. These data were built for use with the causeSims R package found here: https://github.com/jean997/causeSims

  20. 4

    TUD R Cafe Plot-a-thon: 4TU.ResearchData statistics

    • data.4tu.nl
    zip
    Updated Oct 11, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyeokjin Kwon (2023). TUD R Cafe Plot-a-thon: 4TU.ResearchData statistics [Dataset]. http://doi.org/10.4121/7b8ae119-47b9-4759-9c1f-90f70f94ba73.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 11, 2023
    Dataset provided by
    4TU.ResearchData
    Authors
    Hyeokjin Kwon
    License

    https://www.gnu.org/licenses/gpl-3.0.htmlhttps://www.gnu.org/licenses/gpl-3.0.html

    Description

    This dataset is to visualize the 4TU.ResearchData resources for the plot-a-thon.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Enrico68 (2024). Using Descriptive Statistics to Analyse Data in R [Dataset]. https://www.kaggle.com/datasets/enrico68/using-descriptive-statistics-to-analyse-data-in-r
Organization logo

Using Descriptive Statistics to Analyse Data in R

Guided Project Learn how to calculate descriptive statistical metrics in order t

Explore at:
zip(105561 bytes)Available download formats
Dataset updated
May 9, 2024
Authors
Enrico68
Description

Load and view a real-world dataset in RStudio

• Calculate “Measure of Frequency” metrics

• Calculate “Measure of Central Tendency” metrics

• Calculate “Measure of Dispersion” metrics

• Use R’s in-built functions for additional data quality metrics

• Create a custom R function to calculate descriptive statistics on any given dataset

Search
Clear search
Close search
Google apps
Main menu