30 datasets found
  1. f

    Letter-Value Plots: Boxplots for Large Data

    • tandf.figshare.com
    zip
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Heike Hofmann; Hadley Wickham; Karen Kafadar (2023). Letter-Value Plots: Boxplots for Large Data [Dataset]. http://doi.org/10.6084/m9.figshare.4748227.v3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Taylor & Francis
    Authors
    Heike Hofmann; Hadley Wickham; Karen Kafadar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Boxplots are useful displays that convey rough information about the distribution of a variable. Boxplots were designed to be drawn by hand and work best for small datasets, where detailed estimates of tail behavior beyond the quartiles may not be trustworthy. Larger datasets afford more precise estimates of tail behavior, but boxplots do not take advantage of this precision, instead presenting large numbers of extreme, though not unexpected, observations. Letter-value plots address this problem by including more detailed information about the tails using “letter values,” an order statistic defined by Tukey. Boxplots display the first two letter values (the median and quartiles); letter-value plots display further letter values so far as they are reliable estimates of their corresponding quantiles. We illustrate letter-value plots with real data that demonstrate their usefulness for large datasets. All graphics are created using the R package lvplot, and code and data are available in the supplementary materials.

  2. b

    Lower quartile house price (affordability ratios) - WMCA

    • cityobservatory.birmingham.gov.uk
    csv, excel, geojson +1
    Updated Jul 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Lower quartile house price (affordability ratios) - WMCA [Dataset]. https://cityobservatory.birmingham.gov.uk/explore/dataset/lower-quartile-house-price-affordability-ratios-wmca/
    Explore at:
    csv, excel, geojson, jsonAvailable download formats
    Dataset updated
    Jul 3, 2025
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This is the unadjusted lower quartile house priced for residential property sales (transactions) in the area for a 12 month period with April in the middle (year-ending September). These figures have been produced by the ONS (Office for National Statistics) using the Land Registry (LR) Price Paid data on residential dwelling transactions.

    The LR Price Paid data are comprehensive in that they capture changes of ownership for individual residential properties which have sold for full market value and covers both cash sales and those involving a mortgage.

    The lower quartile is the value determined by putting all the house sales for a given year, area and type in order of price and then selecting the price of the house sale which falls three quarters of the way down the list, such that 75Percentage of transactions lie above and 25Percentage lie below that value. These are particularly useful for assessing housing affordability when viewed alongside average and lower quartile income for given areas.

    Note that a transaction occurs when a change of freeholder or leaseholder takes place regardless of the amount of money involved and a property can transact more than once in the time period.

    The LR records the actual price for which the property changed hands. This will usually be an accurate reflection of the market value for the individual property, but it is not always the case. In order to generate statistics that more accurately reflect market values, the LR has excluded records of houses that were not sold at market value from the dataset. The remaining data are considered a good reflection of market values at the time of the transaction. For full details of exclusions and more information on the methodology used to produce these statistics please see http://www.ons.gov.uk/peoplepopulationandcommunity/housing/qmis/housepricestatisticsforsmallareasqmi

    The LR Price Paid data are not adjusted to reflect the mix of houses in a given area. Fluctuations in the types of house that are sold in that area can cause differences between the lower quartile transactional value of houses and the overall market value of houses.

    If, for a given year, for house type and area there were fewer than 5 sales records in the LR Price Paid data, the house price statistics are not reported." Data is Powered by LG Inform Plus and automatically checked for new data on the 3rd of each month.

  3. Data from: Journal Ranking Dataset

    • kaggle.com
    Updated Aug 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abir (2023). Journal Ranking Dataset [Dataset]. https://www.kaggle.com/datasets/xabirhasan/journal-ranking-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 15, 2023
    Dataset provided by
    Kaggle
    Authors
    Abir
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Journals & Ranking

    An academic journal or research journal is a periodical publication in which research articles relating to a particular academic discipline is published, according to Wikipedia. Currently, there are more than 25,000 peer-reviewed journals that are indexed in citation index databases such as Scopus and Web of Science. These indexes are ranked on the basis of various metrics such as CiteScore, H-index, etc. The metrics are calculated from yearly citation data of the journal. A lot of efforts are given to make a metric that reflects the journal's quality.

    Journal Ranking Dataset

    This is a comprehensive dataset on the academic journals coving their metadata information as well as citation, metrics, and ranking information. Detailed data on their subject area is also given in this dataset. The dataset is collected from the following indexing databases: - Scimago Journal Ranking - Scopus - Web of Science Master Journal List

    The data is collected by scraping and then it was cleaned, details of which can be found in HERE.

    Key Features

    • Rank: Overall rank of journal (derived from sorted SJR index).
    • Title: Name or title of journal.
    • OA: Open Access or not.
    • Country: Country of origin.
    • SJR-index: A citation index calculated by Scimago.
    • CiteScore: A citation index calculated by Scopus.
    • H-index: Hirsh index, the largest number h such that at least h articles in that journal were cited at least h times each.
    • Best Quartile: Top Q-index or quartile a journal has in any subject area.
    • Best Categories: Subject areas with top quartile.
    • Best Subject Area: Highest ranking subject area.
    • Best Subject Rank: Rank of the highest ranking subject area.
    • Total Docs.: Total number of documents of the journal.
    • Total Docs. 3y: Total number of documents in the past 3 years.
    • Total Refs.: Total number of references of the journal.
    • Total Cites 3y: Total number of citations in the past 3 years.
    • Citable Docs. 3y: Total number of citable documents in the past 3 years.
    • Cites/Doc. 2y: Total number of citations divided by the total number of documents in the past 2 years.
    • Refs./Doc.: Total number of references divided by the total number of documents.
    • Publisher: Name of the publisher company of the journal.
    • Core Collection: Web of Science core collection name.
    • Coverage: Starting year of coverage.
    • Active: Active or inactive.
    • In-Press: Articles in press or not.
    • ISO Language Code: Three-letter ISO 639 code for language.
    • ASJC Codes: All Science Journal Classification codes for the journal.

    Rest of the features provide further details on the journal's subject area or category: - Life Sciences: Top level subject area. - Social Sciences: Top level subject area. - Physical Sciences: Top level subject area. - Health Sciences: Top level subject area. - 1000 General: ASJC main category. - 1100 Agricultural and Biological Sciences: ASJC main category. - 1200 Arts and Humanities: ASJC main category. - 1300 Biochemistry, Genetics and Molecular Biology: ASJC main category. - 1400 Business, Management and Accounting: ASJC main category. - 1500 Chemical Engineering: ASJC main category. - 1600 Chemistry: ASJC main category. - 1700 Computer Science: ASJC main category. - 1800 Decision Sciences: ASJC main category. - 1900 Earth and Planetary Sciences: ASJC main category. - 2000 Economics, Econometrics and Finance: ASJC main category. - 2100 Energy: ASJC main category. - 2200 Engineering: ASJC main category. - 2300 Environmental Science: ASJC main category. - 2400 Immunology and Microbiology: ASJC main category. - 2500 Materials Science: ASJC main category. - 2600 Mathematics: ASJC main category. - 2700 Medicine: ASJC main category. - 2800 Neuroscience: ASJC main category. - 2900 Nursing: ASJC main category. - 3000 Pharmacology, Toxicology and Pharmaceutics: ASJC main category. - 3100 Physics and Astronomy: ASJC main category. - 3200 Psychology: ASJC main category. - 3300 Social Sciences: ASJC main category. - 3400 Veterinary: ASJC main category. - 3500 Dentistry: ASJC main category. - 3600 Health Professions: ASJC main category.

  4. D

    Report Card Administrators by School Poverty Quartile School Years 2017-18...

    • data.wa.gov
    • catalog.data.gov
    application/rdfxml +5
    Updated Jan 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OSPI (2025). Report Card Administrators by School Poverty Quartile School Years 2017-18 to 2023-24 [Dataset]. https://data.wa.gov/resource/fhnj-yqpr
    Explore at:
    csv, json, application/rssxml, xml, tsv, application/rdfxmlAvailable download formats
    Dataset updated
    Jan 16, 2025
    Dataset authored and provided by
    OSPI
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This file includes Report Card administrator experience status by school poverty quartile data for the 2017-18 through 2023-24 school years. Data is disaggregated by state, ESD, LEA, and school level. Please review the notes below for more information.

  5. Speed Test API - Minimum and Maximum Quartile speeds by Geography Type

    • catalog.data.gov
    • data.amerigeoss.org
    Updated Mar 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Telecommunication and Information Administration, Department of Commerce (2021). Speed Test API - Minimum and Maximum Quartile speeds by Geography Type [Dataset]. https://catalog.data.gov/dataset/speed-test-api-minimum-and-maximum-quartile-speeds-by-geography-type
    Explore at:
    Dataset updated
    Mar 11, 2021
    Dataset provided by
    United States Department of Commercehttp://www.commerce.gov/
    Description

    This API returns the minimum and maximum quartile speeds by geography type within the nation.

  6. COVID-19 Vaccine Progress Dashboard Data by ZIP Code

    • data.chhs.ca.gov
    • data.ca.gov
    • +2more
    csv, xlsx, zip
    Updated Jul 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2025). COVID-19 Vaccine Progress Dashboard Data by ZIP Code [Dataset]. https://data.chhs.ca.gov/dataset/covid-19-vaccine-progress-dashboard-data-by-zip-code
    Explore at:
    csv(5478164), xlsx(7800), csv(27663424), csv(21567128), csv(9320174), xlsx(10933), zipAvailable download formats
    Dataset updated
    Jul 4, 2025
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    Description

    Note: In these datasets, a person is defined as up to date if they have received at least one dose of an updated COVID-19 vaccine. The Centers for Disease Control and Prevention (CDC) recommends that certain groups, including adults ages 65 years and older, receive additional doses.

    Starting on July 13, 2022, the denominator for calculating vaccine coverage has been changed from age 5+ to all ages to reflect new vaccine eligibility criteria. Previously the denominator was changed from age 16+ to age 12+ on May 18, 2021, then changed from age 12+ to age 5+ on November 10, 2021, to reflect previous changes in vaccine eligibility criteria. The previous datasets based on age 12+ and age 5+ denominators have been uploaded as archived tables.

    Starting June 30, 2021, the dataset has been reconfigured so that all updates are appended to one dataset to make it easier for API and other interfaces. In addition, historical data has been extended back to January 5, 2021.

    This dataset shows full, partial, and at least 1 dose coverage rates by zip code tabulation area (ZCTA) for the state of California. Data sources include the California Immunization Registry and the American Community Survey’s 2015-2019 5-Year data.

    This is the data table for the LHJ Vaccine Equity Performance dashboard. However, this data table also includes ZTCAs that do not have a VEM score.

    This dataset also includes Vaccine Equity Metric score quartiles (when applicable), which combine the Public Health Alliance of Southern California’s Healthy Places Index (HPI) measure with CDPH-derived scores to estimate factors that impact health, like income, education, and access to health care. ZTCAs range from less healthy community conditions in Quartile 1 to more healthy community conditions in Quartile 4.

    The Vaccine Equity Metric is for weekly vaccination allocation and reporting purposes only. CDPH-derived quartiles should not be considered as indicative of the HPI score for these zip codes. CDPH-derived quartiles were assigned to zip codes excluded from the HPI score produced by the Public Health Alliance of Southern California due to concerns with statistical reliability and validity in populations smaller than 1,500 or where more than 50% of the population resides in a group setting.

    These data do not include doses administered by the following federal agencies who received vaccine allocated directly from CDC: Indian Health Service, Veterans Health Administration, Department of Defense, and the Federal Bureau of Prisons.

    For some ZTCAs, vaccination coverage may exceed 100%. This may be a result of many people from outside the county coming to that ZTCA to get their vaccine and providers reporting the county of administration as the county of residence, and/or the DOF estimates of the population in that ZTCA are too low. Please note that population numbers provided by DOF are projections and so may not be accurate, especially given unprecedented shifts in population as a result of the pandemic.

  7. Data set - Measured in a context : making sense of open access book data

    • zenodo.org
    • data.niaid.nih.gov
    Updated Jul 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ronald Snijder; Ronald Snijder (2023). Data set - Measured in a context : making sense of open access book data [Dataset]. http://doi.org/10.5281/zenodo.8118120
    Explore at:
    Dataset updated
    Jul 6, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ronald Snijder; Ronald Snijder
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    For more than a decade, open access book platforms have been distributing titles in order to maximise their impact. Each platform offers some form of usage data, showcasing the success of their offering. However, the numbers alone are not sufficient to convey how well a book is actually performing.

    Our data set is consists of 18,014 books and chapters. The selected titles have been added to the OAPEN Library collection before 1 January 2022, and the usage data of twelve months (January to December 2022) has been captured. During that period, this collection of books and chapters has been downloaded more than 10 million times. Each title has been linked to one broad subject and the title’s language has been coded as either English, German or other languages.

    The titles are rated using the TOANI score.

    The acronym stands for Transparent Open Access Normalised Index. The transparency is based on the application of clear regulations, and by making all data used visible. The data is normalised, by using a common scale for the complete collection of an open access book platform. Additionally, there are only three possible values to score the titles: average, less than average and more than average. This index is set up to provide a clear and simple answer to the question whether an open access book has made an impact. It is not meant to give a sense of false accuracy; the complexities surrounding this issue cannot be measured in several decimal places.

    The TOANI score is based on the following principles:

    • Select only titles that have been available for at least 12 months;
    • Use the usage data of the same 12 months period for the whole collection;
    • Each title is assigned one – high level – subject;
    • Each title is assigned one language;
    • All titles are grouped based on subject and language;
    • The groups should consists of at least 100 titles;
    • The following data must be made available for each title:
      • Platform
      • Total number of titles in the group
      • Subject
      • Language
      • Period used for the measurement
      • Minimum value, maximum value, median, first and third quartile of the platform’s usage data
    • Based on the previous, titles are classified as:
      • “Less than average” – First quartile; 25 % of the titles
      • “Average” – Second and third quartile; 50% of the titles
      • “More than average” – Fourth quartile; 25 % of the titles
  8. A

    COVID-19 Vaccine Progress Dashboard Data

    • data.amerigeoss.org
    • healthdata.gov
    • +5more
    csv, xls, zip
    Updated Jul 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States (2022). COVID-19 Vaccine Progress Dashboard Data [Dataset]. https://data.amerigeoss.org/es/dataset/59814d82-8b5f-4acb-b273-0577326dcebc
    Explore at:
    zip, xls, csvAvailable download formats
    Dataset updated
    Jul 27, 2022
    Dataset provided by
    United States
    Description

    Note: Starting on July 13, 2022, the denominator for calculating vaccine coverage has been changed from age 5+ to all ages to reflect new vaccine eligibility criteria. Previously the denominator was changed from age 16+ to age 12+ on May 18, 2021, then changed from age 12+ to age 5+ on November 10, 2021, to reflect previous changes in vaccine eligibility criteria. The previous datasets based on age 16+ and age 5+ denominators have been uploaded as archived tables.

    Note: Starting on May 29, 2021 the methodology for calculating on-hand inventory in the shipped/delivered/on-hand dataset has changed. Please see the accompanying data dictionary for details. In addition, this dataset is now down to the ZIP code level.

    This data is from the same source as the Vaccine Progress Dashboard at https://covid19.ca.gov/vaccination-progress-data/ which summarizes vaccination data at the county level by county of residence. Where county of residence was not reported in a vaccination record, the county of provider that vaccinated the resident is included. This applies to less than 1% of vaccination records. The sum of county-level vaccinations does not equal statewide total vaccinations due to out-of-state residents vaccinated in California.

    This dataset also includes Vaccine Equity Metric score quartiles, which combine the Public Health Alliance of Southern California’s Healthy Places Index (HPI) measure with CDPH-derived scores to estimate factors that impact health, like income, education, and access to health care. Zip codes range from less healthy community conditions in Quartile 1 to more healthy community conditions in Quartile 4.

    The Vaccine Equity Metric is for weekly vaccination allocation and reporting purposes only. CDPH-derived quartiles should not be considered as indicative of the HPI score for these zip codes. CDPH-derived quartiles were assigned to zip codes excluded from the HPI score produced by the Public Health Alliance of Southern California due to concerns with statistical reliability and validity in populations smaller than 1,500 or where more than 50% of the population resides in a group setting.

    These data do not include doses administered by the following federal agencies who received vaccine allocated directly from CDC: Indian Health Service, Veterans Health Administration, Department of Defense, and the Federal Bureau of Prisons.

    Note: Totals for the Vaccine Progress Dashboard and this dataset may not match, as the Dashboard totals doses by Report Date and this dataset totals doses by Administration Date. Dose numbers may also change for a particular Administration Date as data is updated.

  9. A

    ‘COVID-19 Vaccine Progress Dashboard Data’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Jan 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘COVID-19 Vaccine Progress Dashboard Data’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/data-gov-covid-19-vaccine-progress-dashboard-data-0362/2d0c168f/?iid=000-163&v=presentation
    Explore at:
    Dataset updated
    Jan 27, 2022
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘COVID-19 Vaccine Progress Dashboard Data’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/b35f49ea-75f8-43e3-9b99-fd4f5890be97 on 27 January 2022.

    --- Dataset description provided by original source is as follows ---

    Note: Starting on November 10, 2021, columns have been added for age 5+ denominator for calculating vaccine coverage to reflect new vaccine eligibility criteria.

    In May 18, 2021, the denominator for calculating vaccine coverage has been changed from age 16+ to age 12+ to reflect new vaccine eligibility criteria. The previous dataset based on age 16+ denominators has been uploaded as an archived table.

    Starting on May 29, 2021 the methodology for calculating on-hand inventory in the shipped/delivered/on-hand dataset has changed. Please see the accompanying data dictionary for details. In addition, this dataset is now down to the ZIP code level.

    This data is from the same source as the Vaccine Progress Dashboard at https://covid19.ca.gov/vaccines/ which summarizes vaccination data at the county level by county of residence. Where county of residence was not reported in a vaccination record, the county of provider that vaccinated the resident is included. This applies to less than 1% of vaccination records. The sum of county-level vaccinations does not equal statewide total vaccinations due to out-of-state residents vaccinated in California.

    This dataset also includes Vaccine Equity Metric score quartiles, which combine the Public Health Alliance of Southern California’s Healthy Places Index (HPI) measure with CDPH-derived scores to estimate factors that impact health, like income, education, and access to health care. Zip codes range from less healthy community conditions in Quartile 1 to more healthy community conditions in Quartile 4.

    The Vaccine Equity Metric is for weekly vaccination allocation and reporting purposes only. CDPH-derived quartiles should not be considered as indicative of the HPI score for these zip codes. CDPH-derived quartiles were assigned to zip codes excluded from the HPI score produced by the Public Health Alliance of Southern California due to concerns with statistical reliability and validity in populations smaller than 1,500 or where more than 50% of the population resides in a group setting.

    These data do not include doses administered by the following federal agencies who received vaccine allocated directly from CDC: Indian Health Service, Veterans Health Administration, Department of Defense, and the Federal Bureau of Prisons.

    Note: Totals for the Vaccine Progress Dashboard and this dataset may not match, as the Dashboard totals doses by Report Date and this dataset totals doses by Administration Date. Dose numbers may also change for a particular Administration Date as data is updated.

    --- Original source retains full ownership of the source dataset ---

  10. a

    ABS - Index of Household Advantage and Disadvantage (IHAD) (SA1) 2016 -...

    • data.aurin.org.au
    Updated Mar 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). ABS - Index of Household Advantage and Disadvantage (IHAD) (SA1) 2016 - Dataset - AURIN [Dataset]. https://data.aurin.org.au/dataset/au-govt-abs-abs-ihad-sa1-2016-sa1-2016
    Explore at:
    Dataset updated
    Mar 5, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset presents information from 2016 at the household level; the percentage of households within each Index of Household Advantage and Disadvantage (IHAD) quartile for Statistical Area Level 1 (SA1) 2016 boundaries. The IHAD is an experimental analytical index developed by the Australian Bureau of Statistics (ABS) that provides a summary measure of relative socio-economic advantage and disadvantage for households. It utilises information from the 2016 Census of Population and Housing. IHAD quartiles: All households are ordered from lowest to highest disadvantage, the lowest 25% of households are given a quartile number of 1, the next lowest 25% of households are given a quartile number of 2 and so on, up to the highest 25% of households which are given a quartile number of 4. This means that households are divided up into four groups, depending on their score. This data is ABS data (catalogue number: 4198.0) used with permission from the Australian Bureau of Statistics. For more information please visit the Australian Bureau of Statistics. Please note: AURIN has spatially enabled the original data.

  11. f

    Medians (M) and inter-quartile ranges (IQR) of maximum likelihood parameter...

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan Peters; Stephan Franz Miedl; Christian Büchel (2023). Medians (M) and inter-quartile ranges (IQR) of maximum likelihood parameter estimates for the five discounting models examined (see Table 1 for model equations, numbers and abbreviations). [Dataset]. http://doi.org/10.1371/journal.pone.0047225.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Jan Peters; Stephan Franz Miedl; Christian Büchel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Parameters are shown separately for the three different datasets (1, 2, pathological gamblers [PG]).

  12. n

    Pre-IceBridge LVIS L2 Geolocated Ground Elevation and Return Energy...

    • cmr.earthdata.nasa.gov
    • nsidc.org
    • +4more
    not provided
    Updated Jun 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Pre-IceBridge LVIS L2 Geolocated Ground Elevation and Return Energy Quartiles V001 [Dataset]. http://doi.org/10.5067/BA180DOW1E1Q
    Explore at:
    not providedAvailable download formats
    Dataset updated
    Jun 11, 2025
    Time period covered
    Sep 20, 2007 - Sep 21, 2007
    Area covered
    Description

    This data set contains surface elevation data over Greenland measured by the NASA Land, Vegetation, and Ice Sensor (LVIS), an airborne lidar scanning laser altimeter.

  13. Z

    Data articles in journals

    • data.niaid.nih.gov
    Updated Sep 22, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Loureiro, Vanesa (2023). Data articles in journals [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3753373
    Explore at:
    Dataset updated
    Sep 22, 2023
    Dataset provided by
    Loureiro, Vanesa
    Balsa-Sanchez, Carlota
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Version: 5

    Authors: Carlota Balsa-Sánchez, Vanesa Loureiro

    Date of data collection: 2023/09/05

    General description: The publication of datasets according to the FAIR principles, could be reached publishing a data paper (or software paper) in data journals or in academic standard journals. The excel and CSV file contains a list of academic journals that publish data papers and software papers. File list:

    • data_articles_journal_list_v5.xlsx: full list of 140 academic journals in which data papers or/and software papers could be published
    • data_articles_journal_list_v5.csv: full list of 140 academic journals in which data papers or/and software papers could be published

    Relationship between files: both files have the same information. Two different formats are offered to improve reuse

    Type of version of the dataset: final processed version

    Versions of the files: 5th version - Information updated: number of journals, URL, document types associated to a specific journal.

    Version: 4

    Authors: Carlota Balsa-Sánchez, Vanesa Loureiro

    Date of data collection: 2022/12/15

    General description: The publication of datasets according to the FAIR principles, could be reached publishing a data paper (or software paper) in data journals or in academic standard journals. The excel and CSV file contains a list of academic journals that publish data papers and software papers. File list:

    • data_articles_journal_list_v4.xlsx: full list of 140 academic journals in which data papers or/and software papers could be published
    • data_articles_journal_list_v4.csv: full list of 140 academic journals in which data papers or/and software papers could be published

    Relationship between files: both files have the same information. Two different formats are offered to improve reuse

    Type of version of the dataset: final processed version

    Versions of the files: 4th version - Information updated: number of journals, URL, document types associated to a specific journal, publishers normalization and simplification of document types - Information added : listed in the Directory of Open Access Journals (DOAJ), indexed in Web of Science (WOS) and quartile in Journal Citation Reports (JCR) and/or Scimago Journal and Country Rank (SJR), Scopus and Web of Science (WOS), Journal Master List.

    Version: 3

    Authors: Carlota Balsa-Sánchez, Vanesa Loureiro

    Date of data collection: 2022/10/28

    General description: The publication of datasets according to the FAIR principles, could be reached publishing a data paper (or software paper) in data journals or in academic standard journals. The excel and CSV file contains a list of academic journals that publish data papers and software papers. File list:

    • data_articles_journal_list_v3.xlsx: full list of 124 academic journals in which data papers or/and software papers could be published
    • data_articles_journal_list_3.csv: full list of 124 academic journals in which data papers or/and software papers could be published

    Relationship between files: both files have the same information. Two different formats are offered to improve reuse

    Type of version of the dataset: final processed version

    Versions of the files: 3rd version - Information updated: number of journals, URL, document types associated to a specific journal, publishers normalization and simplification of document types - Information added : listed in the Directory of Open Access Journals (DOAJ), indexed in Web of Science (WOS) and quartile in Journal Citation Reports (JCR) and/or Scimago Journal and Country Rank (SJR).

    Erratum - Data articles in journals Version 3:

    Botanical Studies -- ISSN 1999-3110 -- JCR (JIF) Q2 Data -- ISSN 2306-5729 -- JCR (JIF) n/a Data in Brief -- ISSN 2352-3409 -- JCR (JIF) n/a

    Version: 2

    Author: Francisco Rubio, Universitat Politècnia de València.

    Date of data collection: 2020/06/23

    General description: The publication of datasets according to the FAIR principles, could be reached publishing a data paper (or software paper) in data journals or in academic standard journals. The excel and CSV file contains a list of academic journals that publish data papers and software papers. File list:

    • data_articles_journal_list_v2.xlsx: full list of 56 academic journals in which data papers or/and software papers could be published
    • data_articles_journal_list_v2.csv: full list of 56 academic journals in which data papers or/and software papers could be published

    Relationship between files: both files have the same information. Two different formats are offered to improve reuse

    Type of version of the dataset: final processed version

    Versions of the files: 2nd version - Information updated: number of journals, URL, document types associated to a specific journal, publishers normalization and simplification of document types - Information added : listed in the Directory of Open Access Journals (DOAJ), indexed in Web of Science (WOS) and quartile in Scimago Journal and Country Rank (SJR)

    Total size: 32 KB

    Version 1: Description

    This dataset contains a list of journals that publish data articles, code, software articles and database articles.

    The search strategy in DOAJ and Ulrichsweb was the search for the word data in the title of the journals. Acknowledgements: Xaquín Lores Torres for his invaluable help in preparing this dataset.

  14. Gender, Age, and Emotion Detection from Voice

    • kaggle.com
    Updated May 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohit Zaman (2021). Gender, Age, and Emotion Detection from Voice [Dataset]. https://www.kaggle.com/datasets/rohitzaman/gender-age-and-emotion-detection-from-voice/suggestions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 29, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Rohit Zaman
    Description

    Context

    Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.

    Content

    Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.

    Acknowledgements

    Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/

  15. B

    2016 Census of Canada - Housing Suitability and Shelter-cost-to-income Ratio...

    • borealisdata.ca
    • open.library.ubc.ca
    Updated Apr 9, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statistics Canada (2021). 2016 Census of Canada - Housing Suitability and Shelter-cost-to-income Ratio by Status of Primary Household Maintainer for BC CSDs [custom tabulation] [Dataset]. http://doi.org/10.5683/SP2/6OEKPA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 9, 2021
    Dataset provided by
    Borealis
    Authors
    Statistics Canada
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    British Columbia, Canada
    Description

    This dataset includes one dataset which was custom ordered from Statistics Canada.The table includes information on housing suitability and shelter-cost-to-income ratio by number of bedrooms, housing tenure, status of primary household maintainer, household type, and income quartile ranges for census subdivisions in British Columbia. The dataset is in Beyond 20/20 (.ivt) format. The Beyond 20/20 browser is required in order to open it. This software can be freely downloaded from the Statistics Canada website: https://www.statcan.gc.ca/eng/public/beyond20-20 (Windows only). For information on how to use Beyond 20/20, please see: http://odesi2.scholarsportal.info/documentation/Beyond2020/beyond20-quickstart.pdf https://wiki.ubc.ca/Library:Beyond_20/20_Guide Custom order from Statistics Canada includes the following dimensions and variables: Geography: Non-reserve CSDs in British Columbia - 299 geographies The global non-response rate (GNR) is an important measure of census data quality. It combines total non-response (households) and partial non-response (questions). A lower GNR indicates a lower risk of non-response bias and, as a result, a lower risk of inaccuracy. The counts and estimates for geographic areas with a GNR equal to or greater than 50% are not published in the standard products. The counts and estimates for these areas have a high risk of non-response bias, and in most cases, should not be released. All the geographies requested for this tabulation have been cleared for the release of income data and have a GNR under 50%. Housing Tenure Including Presence of Mortgage (5) 1. Total – Private non-band non-farm off-reserve households with an income greater than zero by housing tenure 2. Households who own 3. With a mortgage1 4. Without a mortgage 5. Households who rent Note: 1) Presence of mortgage - Refers to whether the owner households reported mortgage or loan payments for their dwelling. 2015 Before-tax Household Income Quartile Ranges (5) 1. Total – Private households by quartile ranges1, 2, 3 2. Count of households under or at quartile 1 3. Count of households between quartile 1 and quartile 2 (median) (including at quartile 2) 4. Count of households between quartile 2 (median) and quartile 3 (including at quartile 3) 5. Count of households over quartile 3 Notes: 1) A private household will be assigned to a quartile range depending on its CSD-level location and depending on its tenure (owned and rented). Quartile ranges for owned households in a specific CSD are delimited by the 2015 before-tax income quartiles of owned households with an income greater than zero and residing in non-farm off-reserve dwellings in that CSD. Quartile ranges for rented households in a specific CSD are delimited by the 2015 before-tax income quartiles of rented households with an income greater than zero and residing in non-farm off-reserve dwellings in that CSD. 2) For the income quartiles dollar values (the delimiters) please refer to Table 1. 3) Quartiles 1 to 3 are suppressed if the number of actual records used in the calculation (not rounded or weighted) is less than 16. For cases in which the renters’ quartiles or the owners’ quartiles (figures from Table 1) of a CSD are suppressed the CSD is assigned to a quartile range depending on the provincial renters’ or owners’ quartile figures. Number of Bedrooms (Unit Size) (6) 1. Total – Private households by number of bedrooms1 2. 0 bedrooms (Bachelor/Studio) 3. 1 bedroom 4. 2 bedrooms 5. 3 bedrooms 6. 4 bedrooms Note: 1) Dwellings with 5 bedrooms or more included in the total count only. Housing Suitability (6) 1. Total - Housing suitability 2. Suitable 3. Not suitable 4. One bedroom shortfall 5. Two bedroom shortfall 6. Three or more bedroom shortfall Note: 1) 'Housing suitability' refers to whether a private household is living in suitable accommodations according to the National Occupancy Standard (NOS); that is, whether the dwelling has enough bedrooms for the size and composition of the household. A household is deemed to be living in suitable accommodations if its dwelling has enough bedrooms, as calculated using the NOS. 'Housing suitability' assesses the required number of bedrooms for a household based on the age, sex, and relationships among household members. An alternative variable, 'persons per room,' considers all rooms in a private dwelling and the number of household members. Housing suitability and the National Occupancy Standard (NOS) on which it is based were developed by Canada Mortgage and Housing Corporation (CMHC) through consultations with provincial housing agencies. Shelter-cost-to-income-ratio (4) 1. Total – Private non-band non-farm off-reserve households with an income greater than zero 2. Spending less than 30% of households total income on shelter costs 3. Spending 30% or more of households total income on shelter costs 4. Spending 50% or more of households total income on shelter costs Note: 'Shelter-cost-to-income...

  16. Italy: Mobility COVID-19

    • kaggle.com
    Updated Mar 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mr. Rahman (2021). Italy: Mobility COVID-19 [Dataset]. https://www.kaggle.com/motiurse/italy-mobility-covid19/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 26, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mr. Rahman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Italy
    Description

    A live version of the data record, which will be kept up-to-date with new estimates, can be downloaded from the Humanitarian Data Exchange: https://data.humdata.org/dataset/covid-19-mobility-italy.

    If you find the data helpful or you use the data for your research, please cite our work:

    Pepe, E., Bajardi, P., Gauvin, L., Privitera, F., Lake, B., Cattuto, C., & Tizzoni, M. (2020). COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Scientific Data 7, 230 (2020).

    The data record is structured into 4 comma-separated value (CSV) files, as follows:

    id_provinces_IT.csv. Table of the administrative codes of the 107 Italian provinces. The fields of the table are:

    COD_PROV is an integer field that is used to identify a province in all other data records;

    SIGLA is a two-letters code that identifies the province according to the ISO_3166-2 standard (https://en.wikipedia.org/wiki/ISO_3166-2:IT);

    DEN_PCM is the full name of the province.

    OD_Matrix_daily_flows_norm_full_2020_01_18_2020_04_17.csv. The file contains the daily fraction of users’ moving between Italian provinces. Each line corresponds to an entry of matrix (i, j). The fields of the table are:

    p1: COD_PROV of origin,

    p2: COD_PROV of destination,

    day: in the format yyyy-mm-dd.

    median_q1_q3_rog_2020_01_18_2020_04_17.csv. The file contains median and interquartile range (IQR) of users’ radius of gyration in a province by week. Each entry of the table fields of the table are:

    COD_PROV of the province;

    SIGLA of the province;

    DEN_PCM of the province;

    week: median value of the radius of gyration on week week, with week in the format dd/mm-DD/MM where dd/mm and DD/MM are the first and the last day of the week, respectively.

    week Q1 first quartile (Q1) of the distribution of the radius of gyration on week week,

    week Q3 third quartile (Q3) of the distribution of the radius of gyration on week week,

    average_network_degree_2020_01_18_2020_04_17.csv. The file contains daily time-series of the average degree 〈k〉 of the proximity network. Each entry of the table is a value of 〈k〉 on a given day. The fields of the table are:

    COD_PROV of the province;

    SIGLA of the province;

    DEN_PCM of the province;

    day in the format yyyy-mm-dd.

    ESRI shapefiles of the Italian provinces updated to the most recent definition are available from the website of the Italian National Office of Statistics (ISTAT): https://www.istat.it/it/archivio/222527.

  17. DOAC Reanalysis Dataset

    • zenodo.org
    bin
    Updated Oct 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kim Boesen; Luis Carlos Saiz; Peter C Gøtzsche; Juan Erviti; Kim Boesen; Luis Carlos Saiz; Peter C Gøtzsche; Juan Erviti (2024). DOAC Reanalysis Dataset [Dataset]. http://doi.org/10.5281/zenodo.13960575
    Explore at:
    binAvailable download formats
    Dataset updated
    Oct 21, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Kim Boesen; Luis Carlos Saiz; Peter C Gøtzsche; Juan Erviti; Kim Boesen; Luis Carlos Saiz; Peter C Gøtzsche; Juan Erviti
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Welcome to the direct oral anticoagulant (DOAC) Reanalysis Dataset.

    Sheet 1: Exact references to the FDA reviews from which we extracted all data points. You will also find links to the FDA drug approval packages, where one also finds all other published documents pertaining to the approvals, such as statistical reviews. In Sheet 1, we also cite the primary trial reports for each of the four pivotal DOAC trials.

    Sheet 2: Basic overview of the 4 pivotal DOAC trials with an emphasis on time in therapeutic range (TTR) characteristics.

    ISheet 3: Summary results from each of the 4 DOAC trials for the outcomes of stroke/systemic embolism, major bleed, and mortality (including outcome definitions from each trial).

    Sheet 4: The full TTR dataset with outcomes stratified into quartiles (Q1 to Q4), including exact references to each data point in the FDA reviews.

    Sheet 5: Q4 thresholds and conclusions in the industry TTR analyses.

  18. a

    ABS ASGS Ed3 SA2 2021 Index of Household Advantage and Disadvantage 2021

    • digital.atlas.gov.au
    Updated Feb 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Digital Atlas of Australia (2025). ABS ASGS Ed3 SA2 2021 Index of Household Advantage and Disadvantage 2021 [Dataset]. https://digital.atlas.gov.au/items/9e69519ea62c4e1eb1224d0b613722ab
    Explore at:
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Digital Atlas of Australia
    Area covered
    Description

    The Index of Household Advantage and Disadvantage (IHAD) provides a summary measure of relative socio-economic advantage and disadvantage for households, based on the characteristics of dwellings and the people living within them, using 2021 Census data.

    All in-scope households are ordered from lowest to highest score. A low score indicates relatively greater disadvantage and a lack of advantage in general. A high score indicates a relative lack of disadvantage and greater advantage in general.

    This dataset presents IHAD data in quartiles. The lowest 25% of households are given a quartile number of 1, the next lowest 25% of households are given a quartile number of 2 and so on, up to the highest 25% of households which are given a quartile number of 4. This means that households are divided into four equal sized groups, depending on their score. In practice these groups won’t each be exactly 25% of households as it depends on the distribution of the IHAD scores. The data is grouped by Statistical Area Level 2 (SA2 2021). SA2s are defined by the Australian Statistical Geography Standard (ASGS) Edition 3.

    Key Attributes:

          Field alias
          Field name
          Description
    
    
          Statistical Areas Level 2 2021 code
          SA2_CODE_2021
          2021 Statistical Areas Level 2 (SA2) codes from the Australian Statistical Geography Standard (ASGS), Edition 3. SA2s are medium-sized general purpose areas built to represent communities that interact together socially and economically. 
    
    
            Statistical Areas Level 2 2021 name
          SA2_NAME_2021
          2021 Statistical Areas Level 2 (SA2) names from the Australian Statistical Geography Standard (ASGS), Edition 3. SA2s are medium-sized general purpose areas built to represent communities that interact together socially and economically. 
    
    
          Area in square kilometres
          AREA_ALBERS_SQKM
          The area of a region in square kilometres, based on the Albers equal area conic projection.
    
    
          Uniform Resource Identifier
          ASGS_LOCI_URI_2021
          A uniform resource identifier can be used in web linked applications for data integration. 
    
    
          IHAD quartile 1
          IHAD_QUARTILE1
          Proportion of in-scope dwellings in the SA2 that fall into IHAD quartile 1, indicating relatively greater disadvantage and a lack of advantage in general.
    
    
          IHAD quartile 2
          IHAD_QUARTILE2
          Proportion of in-scope dwellings in the SA2 that fall into IHAD quartile 2.
    
    
          IHAD quartile 3
          IHAD_QUARTILE3
          Proportion of in-scope dwellings in the SA2 that fall into IHAD quartile 3.
    
    
          IHAD quartile 4
          IHAD_QUARTILE4
          Proportion of in-scope dwellings in the SA2 that fall into IHAD quartile 4, indicating a relative lack of disadvantage and greater advantage in general.
    
    
          Occupied private dwellings
          OPD_2021
          Dwellings in-scope of the IHAD i.e. classifiable occupied private dwellings.
    
    
          SEIFA IRSAD quartile
          IRSAD_QUARTILE
          Index of Relative Socio-economic Advantage and Disadvantage quartile. All SA2s are ordered from lowest to highest score, the lowest 25% of SA2s are given a quartile number of 1, the next lowest 25% of SA2s are given a quartile number of 2 and so on, up to the highest 25% of SA2s which are given a quartile number of 4. This means that SA2s are divided into four equal sized groups, depending on their score. In practice these groups won’t each be exactly 25% of SA2s as it depends on the distribution of SEIFA scores.
    
    
          Usual resident population
          URP_2021
          Population counts in this column are based on place of usual residence as reported on Census Night. These include persons out of scope of the IHAD.
    
    
          Dwellings
          DWELLING
          Total dwellings at Census time, including dwellings out of scope of the IHAD e.g. unoccupied private dwellings.
    

    Please note: Proportional totals may equal more than 100% due to rounding and random adjustments made to the data. When calculating proportions, percentages, or ratios from cross-classified or small area tables, the random error introduced can be ignored except when very small cells are involved, in which case the impact on percentages and ratios can be significant. Refer to the Introduced random error / perturbation Census page on the ABS website for more information.

    Data and geography references

    Source data publication: Index of Household Advantage and Disadvantage Geographic boundary information: Australian Statistical Geography Standard (ASGS) Edition 3 Further information: Index of Household Advantage and Disadvantage methodology, 2021 Source: Australian Bureau of Statistics (ABS)

    Contact the Australian Bureau of Statistics

    Email geography@abs.gov.au if you have any questions or feedback about this web service.
    Subscribe to get updates on ABS web services and geospatial products.
    

    Privacy at the Australian Bureau of Statistics Read how the ABS manages personal information - ABS privacy policy.

  19. d

    NODC Standard Product: International Ocean Atlas Volume 4 - Atlas of...

    • catalog.data.gov
    • search.dataone.org
    Updated Jul 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact) (2025). NODC Standard Product: International Ocean Atlas Volume 4 - Atlas of temperature / salinity frequency distributions (2 disc set) (NCEI Accession 0101473) [Dataset]. https://catalog.data.gov/dataset/nodc-standard-product-international-ocean-atlas-volume-4-atlas-of-temperature-salinity-frequenc1
    Explore at:
    Dataset updated
    Jul 1, 2025
    Dataset provided by
    (Point of Contact)
    Description

    This Atlas presents more than 80,000 plots of the empirical frequency distributions of temperature and salinity for each 5-degree square area of the North Atlantic Ocean (80N to 30S) at all standard depth levels based on World Ocean Database 1998 data. Additional empirical statistical plots include the mean and standard deviation based on the arithmetic mean, the median and Median Absolute Deviation (MAD), winsorized estimates of the mean and standard deviation, quartiles, and skewness estimated from the quartiles. Some of these statistics are presented in both "normalized" and "natural" coordinates. Disc 1 contains seasonal distributions for the upper (0 m to 400 m) ocean. Disc 2 contains annual distributions for the deep (500 m - 5500 m) ocean.

  20. g

    Gender Pay Gaps in London | gimi9.com

    • gimi9.com
    Updated Jun 14, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Gender Pay Gaps in London | gimi9.com [Dataset]. https://gimi9.com/dataset/london_gender-pay-gaps
    Explore at:
    Dataset updated
    Jun 14, 2024
    Area covered
    London
    Description

    This dataset contains gender pay gap figures for all employees in London and large employers in London. The pay gap figures for GLA group organisations can be found on their respective websites. The gender pay gap is the difference in the average hourly wage of all men and women across a workforce. If women do more of the less well paid jobs within an organisation than men, the gender pay gap is usually bigger. The UK government publish gender pay gap figures for all employers with 250 or more employees. A cut of this dataset that only shows employers that are registered in London can be found below. Read a report by the Local Government Association (LGA) that summarises the mean and median pay gaps in local authorities, as well as the distribution of staff across pay quartiles. This dataset is one of the Greater London Authority's measures of Economic Fairness. Click here to find out more. This dataset is one of the Greater London Authority's measures of Economic Development strategy. Click here to find out more.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Heike Hofmann; Hadley Wickham; Karen Kafadar (2023). Letter-Value Plots: Boxplots for Large Data [Dataset]. http://doi.org/10.6084/m9.figshare.4748227.v3

Letter-Value Plots: Boxplots for Large Data

Related Article
Explore at:
zipAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
Taylor & Francis
Authors
Heike Hofmann; Hadley Wickham; Karen Kafadar
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Boxplots are useful displays that convey rough information about the distribution of a variable. Boxplots were designed to be drawn by hand and work best for small datasets, where detailed estimates of tail behavior beyond the quartiles may not be trustworthy. Larger datasets afford more precise estimates of tail behavior, but boxplots do not take advantage of this precision, instead presenting large numbers of extreme, though not unexpected, observations. Letter-value plots address this problem by including more detailed information about the tails using “letter values,” an order statistic defined by Tukey. Boxplots display the first two letter values (the median and quartiles); letter-value plots display further letter values so far as they are reliable estimates of their corresponding quantiles. We illustrate letter-value plots with real data that demonstrate their usefulness for large datasets. All graphics are created using the R package lvplot, and code and data are available in the supplementary materials.

Search
Clear search
Close search
Google apps
Main menu