100+ datasets found
  1. f

    Letter-Value Plots: Boxplots for Large Data

    • tandf.figshare.com
    zip
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Heike Hofmann; Hadley Wickham; Karen Kafadar (2023). Letter-Value Plots: Boxplots for Large Data [Dataset]. http://doi.org/10.6084/m9.figshare.4748227.v3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Taylor & Francis
    Authors
    Heike Hofmann; Hadley Wickham; Karen Kafadar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Boxplots are useful displays that convey rough information about the distribution of a variable. Boxplots were designed to be drawn by hand and work best for small datasets, where detailed estimates of tail behavior beyond the quartiles may not be trustworthy. Larger datasets afford more precise estimates of tail behavior, but boxplots do not take advantage of this precision, instead presenting large numbers of extreme, though not unexpected, observations. Letter-value plots address this problem by including more detailed information about the tails using “letter values,” an order statistic defined by Tukey. Boxplots display the first two letter values (the median and quartiles); letter-value plots display further letter values so far as they are reliable estimates of their corresponding quantiles. We illustrate letter-value plots with real data that demonstrate their usefulness for large datasets. All graphics are created using the R package lvplot, and code and data are available in the supplementary materials.

  2. m

    Data from: Probability waves: adaptive cluster-based correction by...

    • data.mendeley.com
    • narcis.nl
    Updated Feb 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DIMITRI ABRAMOV (2021). Probability waves: adaptive cluster-based correction by convolution of p-value series from mass univariate analysis [Dataset]. http://doi.org/10.17632/rrm4rkr3xn.1
    Explore at:
    Dataset updated
    Feb 8, 2021
    Authors
    DIMITRI ABRAMOV
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    dataset and Octave/MatLab codes/scripts for data analysis Background: Methods for p-value correction are criticized for either increasing Type II error or improperly reducing Type I error. This problem is worse when dealing with thousands or even hundreds of paired comparisons between waves or images which are performed point-to-point. This text considers patterns in probability vectors resulting from multiple point-to-point comparisons between two event-related potentials (ERP) waves (mass univariate analysis) to correct p-values, where clusters of signiticant p-values may indicate true H0 rejection. New method: We used ERP data from normal subjects and other ones with attention deficit hyperactivity disorder (ADHD) under a cued forced two-choice test to study attention. The decimal logarithm of the p-vector (p') was convolved with a Gaussian window whose length was set as the shortest lag above which autocorrelation of each ERP wave may be assumed to have vanished. To verify the reliability of the present correction method, we realized Monte-Carlo simulations (MC) to (1) evaluate confidence intervals of rejected and non-rejected areas of our data, (2) to evaluate differences between corrected and uncorrected p-vectors or simulated ones in terms of distribution of significant p-values, and (3) to empirically verify rate of type-I error (comparing 10,000 pairs of mixed samples whit control and ADHD subjects). Results: the present method reduced the range of p'-values that did not show covariance with neighbors (type I and also type-II errors). The differences between simulation or raw p-vector and corrected p-vectors were, respectively, minimal and maximal for window length set by autocorrelation in p-vector convolution. Comparison with existing methods: Our method was less conservative while FDR methods rejected basically all significant p-values for Pz and O2 channels. The MC simulations, gold-standard method for error correction, presented 2.78±4.83% of difference (all 20 channels) from p-vector after correction, while difference between raw and corrected p-vector was 5,96±5.00% (p = 0.0003). Conclusion: As a cluster-based correction, the present new method seems to be biological and statistically suitable to correct p-values in mass univariate analysis of ERP waves, which adopts adaptive parameters to set correction.

  3. example 1 - time series - USD RUB 1 year data

    • kaggle.com
    Updated Sep 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Denis Andrikov (2024). example 1 - time series - USD RUB 1 year data [Dataset]. https://www.kaggle.com/datasets/denisandrikov/example-1-time-series-usd-rub-1-year-data/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 19, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Denis Andrikov
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    A simple table time series for school probability and statistics. We have to learn how to investigate data: value via time. What we try to do: - mean: average is the sum of all values divided by the number of values. It is also sometimes referred to as mean. - median is the middle number, when in order. Mode is the most common number. Range is the largest number minus the smallest number. - standard deviation s a measure of how dispersed the data is in relation to the mean.

  4. Big data analytics market value in Italy 2015-2020

    • statista.com
    Updated Jun 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2022). Big data analytics market value in Italy 2015-2020 [Dataset]. https://www.statista.com/statistics/1013064/big-data-analytics-market-value-in-italy/
    Explore at:
    Dataset updated
    Jun 22, 2022
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Italy
    Description

    According to the source's data, the market value of big data analytics in Italy increased steadily over the period considered, growing from 790 million euros in 2015 to approximately 1.8 billion euros in 2020.

  5. u

    Daily values

    • api.waterdata.usgs.gov
    Updated Mar 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Daily values [Dataset]. https://api.waterdata.usgs.gov/ogcapi/v0/collections?f=html
    Explore at:
    html, jsonld, json, application/schema+json, application/geo+jsonAvailable download formats
    Dataset updated
    Mar 4, 2025
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Description

    Daily data provide one data value to represent water conditions for the day. Throughout much of the history of the USGS, the primary water data available was daily data collected manually at the monitoring location once each day. With improved availability of computer storage and automated transmission of data, the daily data published today are generally a statistical summary or metric of the continuous data collected each day, such as the daily mean, minimum, or maximum value. Daily data are automatically calculated from the continuous data of the same parameter code and are described by parameter code and a statistic code. These data have also been referred to as “daily values” or “DV”.

  6. Data visualization market value worldwide 2017 and 2023

    • statista.com
    Updated May 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2022). Data visualization market value worldwide 2017 and 2023 [Dataset]. https://www.statista.com/statistics/1003906/worldwide-data-visualization-market-value/
    Explore at:
    Dataset updated
    May 23, 2022
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2017
    Area covered
    Worldwide
    Description

    This statistic shows the global data visualization market revenue in 2017 and 2023. In 2017, the total value of this market was estimated to be 4.51 billion US dollars. The market is expected to increase to 7.76 billion U.S. dollars by 2023, with a CAGR of 9.47 percent over the forecast period.

  7. Economic Census: Core Statistics: US Industry Product Data

    • catalog.data.gov
    Updated Jul 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Census Bureau (2023). Economic Census: Core Statistics: US Industry Product Data [Dataset]. https://catalog.data.gov/dataset/economic-census-core-statistics-us-industry-product-data
    Explore at:
    Dataset updated
    Jul 19, 2023
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Area covered
    United States
    Description

    The Economic Census is the U.S. Government's official five-year measure of American business and the economy. It is conducted by the U.S. Census Bureau, and response is required by law. In October through December of the census year, forms are sent out to nearly 4 million businesses, including large, medium and small companies representing all U.S. locations and industries. Respondents were asked to provide a range of operational and performance data for their companies. This dataset presents company, establishments, value of shipments, value of product shipments, percentage of product shipments of the total value of shipments, and percentage of distribution of value of product shipments.

  8. Data from: Pre-compiled metrics data sets, links to yearly statistics files...

    • doi.pangaea.de
    html, tsv
    Updated Sep 8, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin G Schultz; Sabine Schröder; Olga Lyapina; Owen R Cooper (2017). Pre-compiled metrics data sets, links to yearly statistics files in CSV format [Dataset]. http://doi.org/10.1594/PANGAEA.880505
    Explore at:
    tsv, htmlAvailable download formats
    Dataset updated
    Sep 8, 2017
    Dataset provided by
    PANGAEA
    Authors
    Martin G Schultz; Sabine Schröder; Olga Lyapina; Owen R Cooper
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1970 - Jan 1, 2015
    Variables measured
    DATE/TIME, File name, File size, Uniform resource locator/link to file
    Description

    Errata: On Dec 2nd, 2018, several yearly statistics files were replaced with new versions to correct an inconsistency related to the computation of the "dma8epax" statistics. As written in Schultz et al. (2017) [https://doi.org/10.1525/elementa.244], Supplement 1, Table 6: "When the aggregation period is “seasonal”, “summer”, or “annual”, the 4th highest daily 8-hour maximum of the aggregation period will be computed.". The data values for these aggregation periods are correct, however, the header information in the original files stated that the respective data column would contain "average daily maximum 8-hour ozone mixing ratio (nmol mol-1)". Therefore, the header of the seasonal, summer, and annual files has been corrected. Furthermore, the "dma8epax" column in the monthly files erroneously contained 4th highest daily maximum 8-hour average values, while it should have listed monthly average values instead. The data of this metric in the monthly files have therefore been replaced. The new column header reads "avgdma8epax". The updated files contain a version label "1.1" and a brief description of the error. If you have made use of previous TOAR data files with the "dma8epax" metric, please exchange your data files.

  9. d

    GLO climate data stats summary

    • data.gov.au
    • researchdata.edu.au
    • +1more
    zip
    Updated Apr 13, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bioregional Assessment Program (2022). GLO climate data stats summary [Dataset]. https://data.gov.au/data/dataset/afed85e0-7819-493d-a847-ec00a318e657
    Explore at:
    zip(8810)Available download formats
    Dataset updated
    Apr 13, 2022
    Dataset authored and provided by
    Bioregional Assessment Program
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract

    The dataset was derived by the Bioregional Assessment Programme from multiple source datasets. The source datasets are identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.

    Various climate variables summary for all 15 subregions based on Bureau of Meteorology Australian Water Availability Project (BAWAP) climate grids. Including

    1. Time series mean annual BAWAP rainfall from 1900 - 2012.

    2. Long term average BAWAP rainfall and Penman Potentail Evapotranspiration (PET) from Jan 1981 - Dec 2012 for each month

    3. Values calculated over the years 1981 - 2012 (inclusive), for 17 time periods (i.e., annual, 4 seasons and 12 months) for the following 8 meteorological variables: (i) BAWAP_P (precipitation); (ii) Penman ETp; (iii) Tavg (average temperature); (iv) Tmax (maximum temperature); (v) Tmin (minimum temperature); (vi) VPD (Vapour Pressure Deficit); (vii) Rn (net radiation); and (viii) Wind speed. For each of the 17 time periods for each of the 8 meteorological variables have calculated the: (a) average; (b) maximum; (c) minimum; (d) average plus standard deviation (stddev); (e) average minus stddev; (f) stddev; and (g) trend.

    4. Correlation coefficients (-1 to 1) between rainfall and 4 remote rainfall drivers between 1957-2006 for the four seasons. The data and methodology are described in Risbey et al. (2009).

    As described in the Risbey et al. (2009) paper, the rainfall was from 0.05 degree gridded data described in Jeffrey et al. (2001 - known as the SILO datasets); sea surface temperature was from the Hadley Centre Sea Ice and Sea Surface Temperature dataset (HadISST) on a 1 degree grid. BLK=Blocking; DMI=Dipole Mode Index; SAM=Southern Annular Mode; SOI=Southern Oscillation Index; DJF=December, January, February; MAM=March, April, May; JJA=June, July, August; SON=September, October, November. The analysis is a summary of Fig. 15 of Risbey et al. (2009).

    There are 4 csv files here:

    BAWAP_P_annual_BA_SYB_GLO.csv

    Desc: Time series mean annual BAWAP rainfall from 1900 - 2012.

    Source data: annual BILO rainfall

    P_PET_monthly_BA_SYB_GLO.csv

    long term average BAWAP rainfall and Penman PET from 198101 - 201212 for each month

    Climatology_Trend_BA_SYB_GLO.csv

    Values calculated over the years 1981 - 2012 (inclusive), for 17 time periods (i.e., annual, 4 seasons and 12 months) for the following 8 meteorological variables: (i) BAWAP_P; (ii) Penman ETp; (iii) Tavg; (iv) Tmax; (v) Tmin; (vi) VPD; (vii) Rn; and (viii) Wind speed. For each of the 17 time periods for each of the 8 meteorological variables have calculated the: (a) average; (b) maximum; (c) minimum; (d) average plus standard deviation (stddev); (e) average minus stddev; (f) stddev; and (g) trend

    Risbey_Remote_Rainfall_Drivers_Corr_Coeffs_BA_NSB_GLO.csv

    Correlation coefficients (-1 to 1) between rainfall and 4 remote rainfall drivers between 1957-2006 for the four seasons. The data and methodology are described in Risbey et al. (2009). As described in the Risbey et al. (2009) paper, the rainfall was from 0.05 degree gridded data described in Jeffrey et al. (2001 - known as the SILO datasets); sea surface temperature was from the Hadley Centre Sea Ice and Sea Surface Temperature dataset (HadISST) on a 1 degree grid. BLK=Blocking; DMI=Dipole Mode Index; SAM=Southern Annular Mode; SOI=Southern Oscillation Index; DJF=December, January, February; MAM=March, April, May; JJA=June, July, August; SON=September, October, November. The analysis is a summary of Fig. 15 of Risbey et al. (2009).

    Dataset History

    Dataset was created from various BAWAP source data, including Monthly BAWAP rainfall, Tmax, Tmin, VPD, etc, and other source data including monthly Penman PET, Correlation coefficient data. Data were extracted from national datasets for the GLO subregion.

    BAWAP_P_annual_BA_SYB_GLO.csv

    Desc: Time series mean annual BAWAP rainfall from 1900 - 2012.

    Source data: annual BILO rainfall

    P_PET_monthly_BA_SYB_GLO.csv

    long term average BAWAP rainfall and Penman PET from 198101 - 201212 for each month

    Climatology_Trend_BA_SYB_GLO.csv

    Values calculated over the years 1981 - 2012 (inclusive), for 17 time periods (i.e., annual, 4 seasons and 12 months) for the following 8 meteorological variables: (i) BAWAP_P; (ii) Penman ETp; (iii) Tavg; (iv) Tmax; (v) Tmin; (vi) VPD; (vii) Rn; and (viii) Wind speed. For each of the 17 time periods for each of the 8 meteorological variables have calculated the: (a) average; (b) maximum; (c) minimum; (d) average plus standard deviation (stddev); (e) average minus stddev; (f) stddev; and (g) trend

    Risbey_Remote_Rainfall_Drivers_Corr_Coeffs_BA_NSB_GLO.csv

    Correlation coefficients (-1 to 1) between rainfall and 4 remote rainfall drivers between 1957-2006 for the four seasons. The data and methodology are described in Risbey et al. (2009). As described in the Risbey et al. (2009) paper, the rainfall was from 0.05 degree gridded data described in Jeffrey et al. (2001 - known as the SILO datasets); sea surface temperature was from the Hadley Centre Sea Ice and Sea Surface Temperature dataset (HadISST) on a 1 degree grid. BLK=Blocking; DMI=Dipole Mode Index; SAM=Southern Annular Mode; SOI=Southern Oscillation Index; DJF=December, January, February; MAM=March, April, May; JJA=June, July, August; SON=September, October, November. The analysis is a summary of Fig. 15 of Risbey et al. (2009).

    Dataset Citation

    Bioregional Assessment Programme (2014) GLO climate data stats summary. Bioregional Assessment Derived Dataset. Viewed 18 July 2018, http://data.bioregionalassessments.gov.au/dataset/afed85e0-7819-493d-a847-ec00a318e657.

    Dataset Ancestors

  10. Amount of data created, consumed, and stored 2010-2023, with forecasts to...

    • statista.com
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
    Explore at:
    Dataset updated
    Jun 30, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    May 2024
    Area covered
    Worldwide
    Description

    The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly, reaching *** zettabytes in 2024. Over the next five years up to 2028, global data creation is projected to grow to more than *** zettabytes. In 2020, the amount of data created and replicated reached a new high. The growth was higher than previously expected, caused by the increased demand due to the COVID-19 pandemic, as more people worked and learned from home and used home entertainment options more often. Storage capacity also growing Only a small percentage of this newly created data is kept though, as just * percent of the data produced and consumed in 2020 was saved and retained into 2021. In line with the strong growth of the data volume, the installed base of storage capacity is forecast to increase, growing at a compound annual growth rate of **** percent over the forecast period from 2020 to 2025. In 2020, the installed base of storage capacity reached *** zettabytes.

  11. d

    Annual General Canvass Statistics

    • catalog.data.gov
    • fisheries.noaa.gov
    Updated Jun 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact, Custodian) (2025). Annual General Canvass Statistics [Dataset]. https://catalog.data.gov/dataset/annual-general-canvass-statistics
    Explore at:
    Dataset updated
    Jun 1, 2025
    Dataset provided by
    (Point of Contact, Custodian)
    Description

    This data set contains annual quantities and value for all seafood products that are landed and sold by established seafood dealers and brokers in the Southeast Region (North Carolina through Texas). These types of data, referred to as the general canvass landings statistics, have been collected by the NOAA Fisheries Service, National Marine Fisheries Service and its predecessor agency, the Bureau of Commercial Fisheries. The data are available on computer since the early 1960's. The quantities and values that are reported in this data set include the annual landings that were initiated in 1962. Beginning in 1976, the data were collected monthly. See the sections on Links for the reference to the monthly general canvass landings. The annual general canvass landings include quantities and value for all living marine species and are identified by species (usually the local or common name). These data were collected by field agents employed by the National Marine Fisheries Service or the Bureau of Commercial Fisheries and assigned to local fishing ports. The agents contacted the majority of the seafood dealers or brokers in their assigned areas and recorded the quantities and value for each species or species category from the sales receipts maintained by the seafood dealers. In addition, information on the gear and area of capture is available for most of the landings statistics in the data set. Based on their knowledge of the fishing activity in the area, the agents would estimate the type of fishing gear and area where the fishing was likely to have occurred. More detailed information on the caveats associated with these data is provided in the Characteristics, Caveats and Issues section. However, because these data are summaries, they do not contain information on the quantities of fishing effort or identifications of the fishermen or vessels that caught the fish or shellfish.

  12. s

    Swedish High Value Data Collection: Companies, Geospatial, Meteorological,...

    • store.smartdatahub.io
    Updated Aug 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Swedish High Value Data Collection: Companies, Geospatial, Meteorological, Statistics, and Earth Observation & Environment - Datasets - This service has been deprecated - please visit https://www.smartdatahub.io/ to access data. See the About page for details. // [Dataset]. https://store.smartdatahub.io/dataset/se_lantmateriet_bilaga_1_sweden_proposal_on_high_value_data_20200430_xlsx
    Explore at:
    Dataset updated
    Aug 26, 2024
    Area covered
    Sweden, Earth
    Description

    The dataset collection in focus comprises an assortment of tables, each carrying a distinct set of data. These tables are meticulously sourced from the website of Lantmäteriet (The Swedish Mapping, Cadastral and Land Registration Authority) in Sweden. The dataset provides a wide range of valuable data, including but not limited to, information about companies, geospatial data, meteorological data, statistical data, and earth observation & environmental data. The tables present the data in an organized manner, with the information arranged systematically in columns and rows. This makes it convenient to analyze and draw insights from the dataset. Overall, it's a comprehensive dataset collection that offers a diverse and substantial range of information.

  13. Rural statistics local level data on tourism Gross Value Added (GVA), 2001...

    • gov.uk
    Updated Aug 8, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Environment, Food & Rural Affairs (2013). Rural statistics local level data on tourism Gross Value Added (GVA), 2001 to 2010 [Dataset]. https://www.gov.uk/government/statistical-data-sets/rural-statistics-local-level-data-on-tourism-gva
    Explore at:
    Dataset updated
    Aug 8, 2013
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Environment, Food & Rural Affairs
    Description

    Gross Value Added (GVA) of tourism related industries by Local Authority Rural-Urban Classification, region and local authority for the period from 2001 to 2010.

    GVA measures the contribution to the economy of each individual producer, industry or sector. These statistics are produced for tourism activity in England.

    The definition of tourism used is the http://www.ons.gov.uk/ons/guide-method/method-quality/specific/economy/economic-value-of-tourism/measuring-tourism-locally/2012/note-1/index.html" class="govuk-link">UN WTO definition.

    Source: Office for National Statistics, Annual Business Survey (ABS) and Local Authority Rural-Urban Classification.

    https://assets.publishing.service.gov.uk/media/5a7c7193e5274a5255bceae8/Tourism_NUTS4_Final_v3.xlsx">Tourism GVA

    MS Excel Spreadsheet, 273 KB

    This file may not be suitable for users of assistive technology.

    Request an accessible format.
    If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email defra.helpline@defra.gov.uk. Please tell us what format you need. It will help us if you say what assistive technology you use.

    Defra statistics: rural

    Email mailto:rural.statistics@defra.gov.uk">rural.statistics@defra.gov.uk

    <p class="govuk-body">You can also contact us via Twitter: <a href="https://twitter.com/DefraStats" class="govuk-link">https://twitter.com/DefraStats</a></p>
    

  14. Principal Statistics Of Manufacturing By Industry Cluster - Value Added,...

    • data.gov.sg
    Updated Jun 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Singapore Department of Statistics (2025). Principal Statistics Of Manufacturing By Industry Cluster - Value Added, Annual [Dataset]. https://data.gov.sg/datasets/d_5300314b2a47151f85b1786e449e860c/view
    Explore at:
    Dataset updated
    Jun 13, 2025
    Dataset authored and provided by
    Singapore Department of Statistics
    License

    https://data.gov.sg/open-data-licencehttps://data.gov.sg/open-data-licence

    Time period covered
    Jan 2000 - Dec 2024
    Description

    Dataset from Singapore Department of Statistics. For more information, visit https://data.gov.sg/datasets/d_5300314b2a47151f85b1786e449e860c/view

  15. J

    Replication Data - Comparison of Gross Value Added and Production

    • journaldata.zbw.eu
    pdf, xls
    Updated Mar 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabe de Bondt; Stanimira Vasileva Kosekova; Gabe de Bondt; Stanimira Vasileva Kosekova (2021). Replication Data - Comparison of Gross Value Added and Production [Dataset]. http://doi.org/10.15456/jbnst.2018299.083130
    Explore at:
    pdf, xlsAvailable download formats
    Dataset updated
    Mar 3, 2021
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Gabe de Bondt; Stanimira Vasileva Kosekova; Gabe de Bondt; Stanimira Vasileva Kosekova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Euro Area Growth Signals from Industrial Production: Warnings from a Comparison of Gross Value Added and Production

    This study compares industrial production and gross value added in volume terms in the euro area and euro area countries, because real GDP growth
    signals from industrial production growth might be misleading and earlier released industrial production growth is not one-to-one translated into industrial value added growth. This is an important issue for analysts and policy makers, because industrial production is a standard element in tools for nowcasting real GDP in real time. It also raises the question about the factors explaining these differences. Differences in terms of (changes in) quarterly growth between production and gross value added include sign reversals and can last for consecutive quarters. Persistent level differences might also exist. The explanatory factors for these differences are the treatment of prices, seasonality and coverage. Data limitations prevent a detailed analysis of the price factor, but the other two factors are more closely evaluated. It turns out that the relative importance of these factors varies over time and thus is difficult to assess ex ante for a specific quarter. A remedy is that statisticians further harmonize national accounts and short-term statistics as well as national practices for seasonal adjustment.

  16. d

    International Data Base

    • dknet.org
    • rrid.site
    • +2more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). International Data Base [Dataset]. http://identifiers.org/RRID:SCR_013139
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    A computerized data set of demographic, economic and social data for 227 countries of the world. Information presented includes population, health, nutrition, mortality, fertility, family planning and contraceptive use, literacy, housing, and economic activity data. Tabular data are broken down by such variables as age, sex, and urban/rural residence. Data are organized as a series of statistical tables identified by country and table number. Each record consists of the data values associated with a single row of a given table. There are 105 tables with data for 208 countries. The second file is a note file, containing text of notes associated with various tables. These notes provide information such as definitions of categories (i.e. urban/rural) and how various values were calculated. The IDB was created in the U.S. Census Bureau''s International Programs Center (IPC) to help IPC staff meet the needs of organizations that sponsor IPC research. The IDB provides quick access to specialized information, with emphasis on demographic measures, for individual countries or groups of countries. The IDB combines data from country sources (typically censuses and surveys) with IPC estimates and projections to provide information dating back as far as 1950 and as far ahead as 2050. Because the IDB is maintained as a research tool for IPC sponsor requirements, the amount of information available may vary by country. As funding and research activity permit, the IPC updates and expands the data base content. Types of data include: * Population by age and sex * Vital rates, infant mortality, and life tables * Fertility and child survivorship * Migration * Marital status * Family planning Data characteristics: * Temporal: Selected years, 1950present, projected demographic data to 2050. * Spatial: 227 countries and areas. * Resolution: National population, selected data by urban/rural * residence, selected data by age and sex. Sources of data include: * U.S. Census Bureau * International projects (e.g., the Demographic and Health Survey) * United Nations agencies Links: * ICPSR: http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/08490

  17. f

    Appendix A. Equivalence of the F test and t test in Example 1.

    • wiley.figshare.com
    html
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paul A. Murtaugh (2023). Appendix A. Equivalence of the F test and t test in Example 1. [Dataset]. http://doi.org/10.6084/m9.figshare.3527090.v1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    Wiley
    Authors
    Paul A. Murtaugh
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Equivalence of the F test and t test in Example 1.

  18. H

    Data from: Reply on the comments on When is Statistical Significance not...

    • dataverse.harvard.edu
    tsv
    Updated Aug 27, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2014). Reply on the comments on When is Statistical Significance not Significant? [Dataset]. http://doi.org/10.7910/DVN/27162
    Explore at:
    tsv(4252), tsv(116875), tsv(24666), tsv(1720), tsv(27015), tsv(16144)Available download formats
    Dataset updated
    Aug 27, 2014
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    In this article we reply to Silva and Guarnieri (2014) comments on Figueiredo Filho et al (2013) paper published by the Brazilian Political Science Review. Originally, we developed four recommendations: (1) scholars must always graphically analyze their data before interpreting the p-value; (2) it is pointless to estimate the p-value for non-random samples; (3) the p-value is highly affected by the sample size and (4) it is pointless to estimate the p-value when dealing with data on population. Here we defend our view about the proper use the p-value statistic and we use both observational and simulation data to make our case. Substantively, we hope to advance the debate about statistical significance in Political Science.

  19. Vehicle licensing statistics data files

    • gov.uk
    • s3.amazonaws.com
    Updated Jun 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Transport (2025). Vehicle licensing statistics data files [Dataset]. https://www.gov.uk/government/statistical-data-sets/vehicle-licensing-statistics-data-files
    Explore at:
    Dataset updated
    Jun 11, 2025
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Transport
    Description

    Recent changes

    A number of changes were introduced to these data files in the 2022 release to help meet the needs of our users and to provide more detail.

    Fuel type has been added to:

    • df_VEH0120_GB
    • df_VEH0120_UK
    • df_VEH0160_GB
    • df_VEH0160_UK

    Historic UK data has been added to:

    • df_VEH0124 (now split into 2 files)
    • df_VEH0220
    • df_VEH0270

    A new datafile has been added df_VEH0520.

    We welcome any feedback on the structure of our data files, their usability, or any suggestions for improvements; please contact vehicles statistics.

    How to use CSV files

    CSV files can be used either as a spreadsheet (using Microsoft Excel or similar spreadsheet packages) or digitally using software packages and languages (for example, R or Python).

    When using as a spreadsheet, there will be no formatting, but the file can still be explored like our publication tables. Due to their size, older software might not be able to open the entire file.

    Download data files

    Make and model by quarter

    df_VEH0120_GB: https://assets.publishing.service.gov.uk/media/68494aca74fe8fe0cbb4676c/df_VEH0120_GB.csv">Vehicles at the end of the quarter by licence status, body type, make, generic model and model: Great Britain (CSV, 58.1 MB)

    Scope: All registered vehicles in Great Britain; from 1994 Quarter 4 (end December)

    Schema: BodyType, Make, GenModel, Model, Fuel, LicenceStatus, [number of vehicles; 1 column per quarter]

    df_VEH0120_UK: https://assets.publishing.service.gov.uk/media/68494acb782e42a839d3a3ac/df_VEH0120_UK.csv">Vehicles at the end of the quarter by licence status, body type, make, generic model and model: United Kingdom (CSV, 34.1 MB)

    Scope: All registered vehicles in the United Kingdom; from 2014 Quarter 3 (end September)

    Schema: BodyType, Make, GenModel, Model, Fuel, LicenceStatus, [number of vehicles; 1 column per quarter]

    df_VEH0160_GB: https://assets.publishing.service.gov.uk/media/68494ad774fe8fe0cbb4676d/df_VEH0160_GB.csv">Vehicles registered for the first time by body type, make, generic model and model: Great Britain (CSV, 24.8 MB)

    Scope: All vehicles registered for the first time in Great Britain; from 2001 Quarter 1 (January to March)

    Schema: BodyType, Make, GenModel, Model, Fuel, [number of vehicles; 1 column per quarter]

    df_VEH0160_UK: https://assets.publishing.service.gov.uk/media/68494ad7aae47e0d6c06e078/df_VEH0160_UK.csv">Vehicles registered for the first time by body type, make, generic model and model: United Kingdom (CSV, 8.26 MB)

    Scope: All vehicles registered for the first time in the United Kingdom; from 2014 Quarter 3 (July to September)

    Schema: BodyType, Make, GenModel, Model, Fuel, [number of vehicles; 1 column per quarter]

    Make and model by age

    In order to keep the datafile df_VEH0124 to a reasonable size, it has been split into 2 halves; 1 covering makes starting with A to M, and the other covering makes starting with N to Z.

    df_VEH0124_AM: <a class="govuk-link" href="https://assets.

  20. F

    Business Sector: Value-Added Output for All Workers

    • fred.stlouisfed.org
    json
    Updated May 8, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Business Sector: Value-Added Output for All Workers [Dataset]. https://fred.stlouisfed.org/series/PRS84006051
    Explore at:
    jsonAvailable download formats
    Dataset updated
    May 8, 2025
    License

    https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain

    Description

    Graph and download economic data for Business Sector: Value-Added Output for All Workers (PRS84006051) from Q1 1948 to Q1 2025 about current dollars, value added, output, sector, business, rate, and USA.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Heike Hofmann; Hadley Wickham; Karen Kafadar (2023). Letter-Value Plots: Boxplots for Large Data [Dataset]. http://doi.org/10.6084/m9.figshare.4748227.v3

Letter-Value Plots: Boxplots for Large Data

Related Article
Explore at:
zipAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
Taylor & Francis
Authors
Heike Hofmann; Hadley Wickham; Karen Kafadar
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Boxplots are useful displays that convey rough information about the distribution of a variable. Boxplots were designed to be drawn by hand and work best for small datasets, where detailed estimates of tail behavior beyond the quartiles may not be trustworthy. Larger datasets afford more precise estimates of tail behavior, but boxplots do not take advantage of this precision, instead presenting large numbers of extreme, though not unexpected, observations. Letter-value plots address this problem by including more detailed information about the tails using “letter values,” an order statistic defined by Tukey. Boxplots display the first two letter values (the median and quartiles); letter-value plots display further letter values so far as they are reliable estimates of their corresponding quantiles. We illustrate letter-value plots with real data that demonstrate their usefulness for large datasets. All graphics are created using the R package lvplot, and code and data are available in the supplementary materials.

Search
Clear search
Close search
Google apps
Main menu