70 datasets found
  1. f

    THINGS-data: MEG BIDS raw dataset

    • plus.figshare.com
    bin
    Updated Jun 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker (2023). THINGS-data: MEG BIDS raw dataset [Dataset]. http://doi.org/10.25452/figshare.plus.20563800.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figshare+
    Authors
    Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    MEG raw dataset in BIDS format.

    Part of THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in brain and behavior.

    See related materials in Collection at: https://doi.org/10.25452/figshare.plus.c.6161151

  2. R

    Data from: Prohibited Items Dataset

    • universe.roboflow.com
    zip
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    aezakmi (2025). Prohibited Items Dataset [Dataset]. https://universe.roboflow.com/aezakmi-rdxkf/prohibited-items-tuts5/model/2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 4, 2025
    Dataset authored and provided by
    aezakmi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Objects Bounding Boxes
    Description

    Prohibited Items

    ## Overview
    
    Prohibited Items is a dataset for object detection tasks - it contains Objects annotations for 810 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  3. f

    THINGS-data: fMRI Single Trial Responses (nifti format)

    • plus.figshare.com
    bin
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker (2023). THINGS-data: fMRI Single Trial Responses (nifti format) [Dataset]. http://doi.org/10.25452/figshare.plus.20590140.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Figshare+
    Authors
    Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Whole-brain single trial beta estimates of the THINGS-fMRI data.

    Part of THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in brain and behavior.

    See related materials in Collection at: https://doi.org/10.25452/figshare.plus.c.6161151

  4. LCDS Data

    • kaggle.com
    zip
    Updated Aug 11, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ML Coder (2017). LCDS Data [Dataset]. https://www.kaggle.com/rsaiml/lcds-data
    Explore at:
    zip(4607300 bytes)Available download formats
    Dataset updated
    Aug 11, 2017
    Authors
    ML Coder
    Description

    Context

    There's a story behind every dataset and here's your opportunity to share yours.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  5. Data from: Prohibited Items

    • catalog.data.gov
    • data.wu.ac.at
    Updated Jan 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TSA.GOV Public Website (2024). Prohibited Items [Dataset]. https://catalog.data.gov/dataset/prohibited-items-17d05
    Explore at:
    Dataset updated
    Jan 1, 2024
    Dataset provided by
    Transportation Security Administrationhttp://tsa.gov/
    Description

    Airport prohibited items detected overlay with geography or time hierarchy

  6. d

    Array of Things Locations

    • catalog.data.gov
    • data.cityofchicago.org
    • +4more
    Updated Dec 16, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofchicago.org (2023). Array of Things Locations [Dataset]. https://catalog.data.gov/dataset/array-of-things-locations
    Explore at:
    Dataset updated
    Dec 16, 2023
    Dataset provided by
    data.cityofchicago.org
    Description

    Locations of Array of Things sensor nodes. For more information on the Array of Things project, see https://arrayofthings.github.io.

  7. SMTX Arrest Data

    • kaggle.com
    zip
    Updated Jan 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jehnny Oh (2021). SMTX Arrest Data [Dataset]. https://www.kaggle.com/motherofdata/smpd-data
    Explore at:
    zip(6752907 bytes)Available download formats
    Dataset updated
    Jan 27, 2021
    Authors
    Jehnny Oh
    Description

    see README.md for more information on this project.

    ****UPDATE 7/10/20**** daniel used his formula magic to offset and match the data into one file: arrest list - total. this file contians 2017, 2018, 2019 arrests with demographic data for SMPD.

    ****UPDATE 7/9/20**** The existing files are presented here in their original raw format as received from SMPD. I am presently working on formatting & scrubbing them.

  8. o

    Data from: 3DCoMPaT: Composition of Materials on Parts of 3D Things

    • registry.opendata.aws
    Updated Jul 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vision-CAIR, CEMSE, KAUST (2022). 3DCoMPaT: Composition of Materials on Parts of 3D Things [Dataset]. https://registry.opendata.aws/3dcompat/
    Explore at:
    Dataset updated
    Jul 29, 2022
    Dataset provided by
    <a href="https://cemse.kaust.edu.sa/vision-cair">Vision-CAIR, CEMSE, KAUST</a>
    Description

    3D CoMPaT is a richly annotated large-scale dataset of rendered compositions of Materials on Parts of thousands of unique 3D Models. This dataset primarily focuses on stylizing 3D shapes at part-level with compatible materials. Each object with the applied part-material compositions is rendered from four equally spaced views as well as four randomized views. We introduce a new task, called Grounded CoMPaT Recognition (GCR), to collectively recognize and ground compositions of materials on parts of 3D objects. We present two variations of this task and adapt state-of-art 2D/3D deep learning methods to solve the problem as baselines for future research. We hope our work will help ease future research on compositional 3D Vision.

  9. f

    THINGS-data: fMRI cortical surface flat maps

    • plus.figshare.com
    zip
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker (2023). THINGS-data: fMRI cortical surface flat maps [Dataset]. http://doi.org/10.25452/figshare.plus.20496702.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figshare+
    Authors
    Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Cortical flat maps for three subjects derived from the anatomical MRI images. Cortical surfaces were reconstructed from T1-weighted and T2-weighted anatomical images with freesurfer's reconall procedure. Relaxation cuts were placed manually to allow for flattening of each hemisphere's surface. Results of any analysis of the fMRI data can be viewed on these flat maps with pycortex.

    Part of THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in brain and behavior

    See related materials in Collection at: https://doi.org/10.25452/figshare.plus.c.6161151

  10. Seair Exim Solutions

    • seair.co.in
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Seair Exim Solutions [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    India
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  11. The Items Dataset

    • zenodo.org
    Updated Nov 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrick Egan; Patrick Egan (2024). The Items Dataset [Dataset]. http://doi.org/10.5281/zenodo.10964134
    Explore at:
    Dataset updated
    Nov 13, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Patrick Egan; Patrick Egan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset originally created 03/01/2019 UPDATE: Packaged on 04/18/2019 UPDATE: Edited README on 04/18/2019

    I. About this Data Set This data set is a snapshot of work that is ongoing as a collaboration between Kluge Fellow in Digital Studies, Patrick Egan and an intern at the Library of Congress in the American Folklife Center. It contains a combination of metadata from various collections that contain audio recordings of Irish traditional music. The development of this dataset is iterative, and it integrates visualizations that follow the key principles of trust and approachability. The project, entitled, “Connections In Sound” invites you to use and re-use this data.

    The text available in the Items dataset is generated from multiple collections of audio material that were discovered at the American Folklife Center. Each instance of a performance was listed and “sets” or medleys of tunes or songs were split into distinct instances in order to allow machines to read each title separately (whilst still noting that they were part of a group of tunes). The work of the intern was then reviewed before publication, and cross-referenced with the tune index at www.irishtune.info. The Items dataset consists of just over 1000 rows, with new data being added daily in a separate file.

    The collections dataset contains at least 37 rows of collections that were located by a reference librarian at the American Folklife Center. This search was complemented by searches of the collections by the scholar both on the internet at https://catalog.loc.gov and by using card catalogs.

    Updates to these datasets will be announced and published as the project progresses.

    II. What’s included? This data set includes:

    • The Items Dataset – a .CSV containing Media Note, OriginalFormat, On Website, Collection Ref, Missing In Duplication, Collection, Outside Link, Performer, Solo/multiple, Sub-item, type of tune, Tune, Position, Location, State, Date, Notes/Composer, Potential Linked Data, Instrument, Additional Notes, Tune Cleanup. This .CSV is the direct export of the Items Google Spreadsheet

    III. How Was It Created? These data were created by a Kluge Fellow in Digital Studies and an intern on this program over the course of three months. By listening, transcribing, reviewing, and tagging audio recordings, these scholars improve access and connect sounds in the American Folklife Collections by focusing on Irish traditional music. Once transcribed and tagged, information in these datasets is reviewed before publication.

    IV. Data Set Field Descriptions

    IV

    a) Collections dataset field descriptions

    • ItemId – this is the identifier for the collection that was found at the AFC
    • Viewed – if the collection has been viewed, or accessed in any way by the researchers.
    • On LOC – whether or not there are audio recordings of this collection available on the Library of Congress website.
    • On Other Website – if any of the recordings in this collection are available elsewhere on the internet
    • Original Format – the format that was used during the creation of the recordings that were found within each collection
    • Search – this indicates the type of search that was performed in order that resulted in locating recordings and collections within the AFC
    • Collection – the official title for the collection as noted on the Library of Congress website
    • State – The primary state where recordings from the collection were located
    • Other States – The secondary states where recordings from the collection were located
    • Era / Date – The decade or year associated with each collection
    • Call Number – This is the official reference number that is used to locate the collections, both in the urls used on the Library website, and in the reference search for catalog cards (catalog cards can be searched at this address: https://memory.loc.gov/diglib/ihas/html/afccards/afccards-home.html)
    • Finding Aid Online? – Whether or not a finding aid is available for this collection on the internet

    b) Items dataset field descriptions

    • id – the specific identification of the instance of a tune, song or dance within the dataset
    • Media Note – Any information that is included with the original format, such as identification, name of physical item, additional metadata written on the physical item
    • Original Format – The physical format that was used when recording each specific performance. Note: this field is used in order to calculate the number of physical items that were created in each collection such as 32 wax cylinders.
    • On Webste? – Whether or not each instance of a performance is available on the Library of Congress website
    • Collection Ref – The official reference number of the collection
    • Missing In Duplication – This column marks if parts of some recordings had been made available on other websites, but not all of the recordings were included in duplication (see recordings from Philadelphia Céilí Group on Villanova University website)
    • Collection – The official title of the collection given by the American Folklife Center
    • Outside Link – If recordings are available on other websites externally
    • Performer – The name of the contributor(s)
    • Solo/multiple – This field is used to calculate the amount of solo performers vs group performers in each collection
    • Sub-item – In some cases, physical recordings contained extra details, the sub-item column was used to denote these details
    • Type of item – This column describes each individual item type, as noted by performers and collectors
    • Item – The item title, as noted by performers and collectors. If an item was not described, it was entered as “unidentified”
    • Position – The position on the recording (in some cases during playback, audio cassette player counter markers were used)
    • Location – Local address of the recording
    • State – The state where the recording was made
    • Date – The date that the recording was made
    • Notes/Composer – The stated composer or source of the item recorded
    • Potential Linked Data – If items may be linked to other recordings or data, this column was used to provide examples of potential relationships between them
    • Instrument – The instrument(s) that was used during the performance
    • Additional Notes – Notes about the process of capturing, transcribing and tagging recordings (for researcher and intern collaboration purposes)
    • Tune Cleanup – This column was used to tidy each item so that it could be read by machines, but also so that spelling mistakes from the Item column could be corrected, and as an aid to preserving iterations of the editing process

    V. Rights statement The text in this data set was created by the researcher and intern and can be used in many different ways under creative commons with attribution. All contributions to Connections In Sound are released into the public domain as they are created. Anyone is free to use and re-use this data set in any way they want, provided reference is given to the creators of these datasets.

    VI. Creator and Contributor Information

    Creator: Connections In Sound

    Contributors: Library of Congress Labs

    VII. Contact Information Please direct all questions and comments to Patrick Egan via www.twitter.com/drpatrickegan or via his website at www.patrickegan.org. You can also get in touch with the Library of Congress Labs team via LC-Labs@loc.gov.

  12. Seair Exim Solutions

    • seair.co.in
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Seair Exim Solutions [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  13. p

    Data from: CLIP: A Dataset for Extracting Action Items for Physicians from...

    • physionet.org
    Updated Jun 21, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    James Mullenbach; Yada Pruksachatkun; Sean Adler; Jennifer Seale; Jordan Swartz; T Greg McKelvey; Yi Yang; David Sontag (2021). CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes [Dataset]. http://doi.org/10.13026/kw00-z903
    Explore at:
    Dataset updated
    Jun 21, 2021
    Authors
    James Mullenbach; Yada Pruksachatkun; Sean Adler; Jennifer Seale; Jordan Swartz; T Greg McKelvey; Yi Yang; David Sontag
    License

    https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts

    Description

    We created a dataset of clinical action items annotated over MIMIC-III. This dataset, which we call CLIP, is annotated by physicians and covers 718 discharge summaries, representing 107,494 sentences. Annotations were collected as character-level spans to discharge summaries after applying surrogate generation to fill in the anonymized templates from MIMIC-III text with faked data. We release these spans, their aggregation into sentence-level labels, and the sentence tokenizer used to aggregate the spans and label sentences. We also release the surrogate data generator, and the document IDs used for training, validation, and test splits, to enable reproduction. The spans are annotated with 0 or more labels of 7 different types, representing the different actions that may need to be taken: Appointment, Lab, Procedure, Medication, Imaging, Patient Instructions, and Other. We encourage the community to use this dataset to develop methods for automatically extracting clinical action items from discharge summaries.

  14. Seair Exim Solutions

    • seair.co.in
    Updated Mar 12, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim (2017). Seair Exim Solutions [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset updated
    Mar 12, 2017
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  15. d

    LIB - Items borrowed through holds system

    • catalog.data.gov
    • data.montgomerycountymd.gov
    Updated May 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.montgomerycountymd.gov (2025). LIB - Items borrowed through holds system [Dataset]. https://catalog.data.gov/dataset/lib-items-borrowed-through-holds-system
    Explore at:
    Dataset updated
    May 17, 2025
    Dataset provided by
    data.montgomerycountymd.gov
    Description

    This dataset contains data for the total number of items processed for customer requests to 'holds' at each branch library. Update Frequency : Annually

  16. i

    False Data Injection Attack Dataset for Industrial Internet of Things

    • ieee-dataport.org
    Updated Nov 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AKM Ahasan Habib (2024). False Data Injection Attack Dataset for Industrial Internet of Things [Dataset]. https://ieee-dataport.org/documents/false-data-injection-attack-dataset-industrial-internet-things
    Explore at:
    Dataset updated
    Nov 29, 2024
    Authors
    AKM Ahasan Habib
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    operating system

  17. UKCES spend transaction data: items greater than £500

    • gov.uk
    Updated Feb 1, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UK Commission for Employment and Skills (2014). UKCES spend transaction data: items greater than £500 [Dataset]. https://www.gov.uk/government/publications/ukces-spend-transaction-data-items-greater-than-500
    Explore at:
    Dataset updated
    Feb 1, 2014
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    UK Commission for Employment and Skills
    Description

    This financial information is released monthly and is available from October 2011 onwards.

  18. Kitchen Stuff Plus Inc Importer/Buyer Data in USA, Kitchen Stuff Plus Inc...

    • seair.co.in
    Updated Feb 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim (2025). Kitchen Stuff Plus Inc Importer/Buyer Data in USA, Kitchen Stuff Plus Inc Imports Data [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset updated
    Feb 21, 2025
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  19. Wedding gift items dataset

    • kaggle.com
    Updated May 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kanchana1990 (2024). Wedding gift items dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/8453309
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 18, 2024
    Dataset provided by
    Kaggle
    Authors
    Kanchana1990
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Dataset Overview

    This dataset contains detailed information on various wedding gifts listed on Etsy. Each entry includes the product name, HTML-formatted description, price, listing date, and the number of favorites. With 748 rows and 5 columns, this dataset provides a comprehensive look at the wedding gift offerings on Etsy as of May 2024.

    Data Science Applications

    This dataset is ideal for a variety of data science applications, including: - Exploratory Data Analysis (EDA): Understand trends in wedding gift preferences and popularity. - Natural Language Processing (NLP): Analyze product descriptions for sentiment, key phrases, and themes. - Pricing Strategy Analysis: Study the distribution of product prices and factors influencing pricing. - Recommendation Systems: Develop models to suggest popular or highly favorited products.

    Column Descriptors

    1. name: The name of the product.
    2. descriptionHTML: The HTML-formatted description of the product, which may include links to the product pages.
    3. Price: The price of the product, listed in various currencies.
    4. listedOn: The date the product was listed.
    5. favorites: The number of times the product has been favorited by users.

    Ethically Mined Data

    The data in this dataset was collected from publicly available listings on Etsy and compiled for educational and analytical purposes. No personally identifiable information (PII) is included, ensuring the dataset adheres to ethical data collection practices.

    Acknowledgements

    Special thanks to Etsy for providing a platform where this data could be collected and shared for educational and analytical purposes. Also for DALL E3 for the thumbnail image.

  20. Z

    UCI and OpenML Data Sets for Ordinal Quantification

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Moreo, Alejandro (2023). UCI and OpenML Data Sets for Ordinal Quantification [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8177301
    Explore at:
    Dataset updated
    Jul 25, 2023
    Dataset provided by
    Bunse, Mirko
    Moreo, Alejandro
    Sebastiani, Fabrizio
    Senz, Martin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These four labeled data sets are targeted at ordinal quantification. The goal of quantification is not to predict the label of each individual instance, but the distribution of labels in unlabeled sets of data.

    With the scripts provided, you can extract CSV files from the UCI machine learning repository and from OpenML. The ordinal class labels stem from a binning of a continuous regression label.

    We complement this data set with the indices of data items that appear in each sample of our evaluation. Hence, you can precisely replicate our samples by drawing the specified data items. The indices stem from two evaluation protocols that are well suited for ordinal quantification. To this end, each row in the files app_val_indices.csv, app_tst_indices.csv, app-oq_val_indices.csv, and app-oq_tst_indices.csv represents one sample.

    Our first protocol is the artificial prevalence protocol (APP), where all possible distributions of labels are drawn with an equal probability. The second protocol, APP-OQ, is a variant thereof, where only the smoothest 20% of all APP samples are considered. This variant is targeted at ordinal quantification tasks, where classes are ordered and a similarity of neighboring classes can be assumed.

    Usage

    You can extract four CSV files through the provided script extract-oq.jl, which is conveniently wrapped in a Makefile. The Project.toml and Manifest.toml specify the Julia package dependencies, similar to a requirements file in Python.

    Preliminaries: You have to have a working Julia installation. We have used Julia v1.6.5 in our experiments.

    Data Extraction: In your terminal, you can call either

    make

    (recommended), or

    julia --project="." --eval "using Pkg; Pkg.instantiate()" julia --project="." extract-oq.jl

    Outcome: The first row in each CSV file is the header. The first column, named "class_label", is the ordinal class.

    Further Reading

    Implementation of our experiments: https://github.com/mirkobunse/regularized-oq

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker (2023). THINGS-data: MEG BIDS raw dataset [Dataset]. http://doi.org/10.25452/figshare.plus.20563800.v1

THINGS-data: MEG BIDS raw dataset

Related Article
Explore at:
binAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
Figshare+
Authors
Martin Hebart; Oliver Contier; Lina Teichmann; Adam Rockter; Charles Zheng; Alexis Kidder; Anna Corriveau; Maryam Vaziri-Pashkam; Chris Baker
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

MEG raw dataset in BIDS format.

Part of THINGS-data: A multimodal collection of large-scale datasets for investigating object representations in brain and behavior.

See related materials in Collection at: https://doi.org/10.25452/figshare.plus.c.6161151

Search
Clear search
Close search
Google apps
Main menu