4 datasets found
  1. O

    MUSIED

    • opendatalab.com
    zip
    Updated Feb 5, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peking University (2023). MUSIED [Dataset]. https://opendatalab.com/OpenDataLab/MUSIED
    Explore at:
    zip(35559885 bytes)Available download formats
    Dataset updated
    Feb 5, 2023
    Dataset provided by
    Peking University
    License

    https://choosealicense.com/licenses/agpl-3.0/https://choosealicense.com/licenses/agpl-3.0/

    Description

    We propose a new largescale Chinese event detection dataset based on user reviews, text conversations, and phone conversations in a leading e-commerce platform for food service.

  2. m

    General conversation speech datasets in English for Wedding Planning

    • data.macgence.com
    mp3
    Updated Jul 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Macgence (2024). General conversation speech datasets in English for Wedding Planning [Dataset]. https://data.macgence.com/dataset/general-conversation-speech-datasets-in-english-for-wedding-planning
    Explore at:
    mp3Available download formats
    Dataset updated
    Jul 31, 2024
    Dataset authored and provided by
    Macgence
    License

    https://data.macgence.com/terms-and-conditionshttps://data.macgence.com/terms-and-conditions

    Time period covered
    2025
    Area covered
    Worldwide
    Variables measured
    Outcome, Call Type, Transcriptions, Audio Recordings, Speaker Metadata, Conversation Topics
    Description

    Plan memorable weddings with Macgence's English conversation dataset. Ideal for AI, linguistics, and event tech, ensuring relevance and impactful insights!

  3. Data from: PHEME dataset of rumours and non-rumours

    • figshare.com
    bz2
    Updated May 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arkaitz Zubiaga; Geraldine Wong Sak Hoi; Maria Liakata; Rob Procter (2023). PHEME dataset of rumours and non-rumours [Dataset]. http://doi.org/10.6084/m9.figshare.4010619.v1
    Explore at:
    bz2Available download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Authors
    Arkaitz Zubiaga; Geraldine Wong Sak Hoi; Maria Liakata; Rob Procter
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains a collection of Twitter rumours and non-rumours posted during breaking news. The five breaking news provided with the dataset are as follows:* Charlie Hebdo: 458 rumours (22.0%) and 1,621 non-rumours (78.0%).* Ferguson: 284 rumours (24.8%) and 859 non-rumours (75.2%).* Germanwings Crash: 238 rumours (50.7%) and 231 non-rumours (49.3%).* Ottawa Shooting: 470 rumours (52.8%) and 420 non-rumours (47.2%).* Sydney Siege: 522 rumours (42.8%) and 699 non-rumours (57.2%).The data is structured as follows. Each event has a directory, with two subfolders, rumours and non-rumours. These two folders have folders named with a tweet ID. The tweet itself can be found on the 'source-tweet' directory of the tweet in question, and the directory 'reactions' has the set of tweets responding to that source tweet.This dataset was used in the paper 'Learning Reporting Dynamics during Breaking News for Rumour Detection in Social Media' for rumour detection. For more details, please refer to the paper.License: The annotations are provided under a CC-BY license, while Twitter retains the ownership and rights of the content of the tweets.

  4. PHEME dataset for Rumour Detection and Veracity Classification

    • figshare.com
    application/gzip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elena Kochkina; Maria Liakata; Arkaitz Zubiaga (2023). PHEME dataset for Rumour Detection and Veracity Classification [Dataset]. http://doi.org/10.6084/m9.figshare.6392078.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Elena Kochkina; Maria Liakata; Arkaitz Zubiaga
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains a collection of Twitter rumours and non-rumours posted during breaking news.

    The data is structured as follows. Each event has a directory, with two subfolders, rumours and non-rumours. These two folders have folders named with a tweet ID. The tweet itself can be found on the 'source-tweet' directory of the tweet in question, and the directory 'reactions' has the set of tweets responding to that source tweet. Also each folder contains ‘annotation.json’ which contains information about veracity of the rumour and ‘structure.json’, which contains information about structure of the conversation.

    This dataset is an extension of the PHEME dataset of rumours and non-rumours (https://figshare.com/articles/PHEME_dataset_of_rumours_and_non-rumours/4010619), it contains rumours related to 9 events and each of the rumours is annotated with its veracity value, either True, False or Unverified.

    This dataset was used in the paper 'All-in-one: Multi-task Learning for Rumour Verification'. For more details, please refer to the paper.

    Code using this dataset can be found on github (https://github.com/kochkinaelena/Multitask4Veracity).

    License: The annotations are provided under a CC-BY license, while Twitter retains the ownership and rights of the content of the tweets.

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Peking University (2023). MUSIED [Dataset]. https://opendatalab.com/OpenDataLab/MUSIED

MUSIED

OpenDataLab/MUSIED

Explore at:
zip(35559885 bytes)Available download formats
Dataset updated
Feb 5, 2023
Dataset provided by
Peking University
License

https://choosealicense.com/licenses/agpl-3.0/https://choosealicense.com/licenses/agpl-3.0/

Description

We propose a new largescale Chinese event detection dataset based on user reviews, text conversations, and phone conversations in a leading e-commerce platform for food service.

Search
Clear search
Close search
Google apps
Main menu