4 datasets found

O
MUSIED
opendatalab.com
zip
Updated Feb 5, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peking University (2023). MUSIED [Dataset]. https://opendatalab.com/OpenDataLab/MUSIED
Explore at:
zip(35559885 bytes)Available download formats
Dataset updated
Feb 5, 2023
Dataset provided by
Peking University
License
https://choosealicense.com/licenses/agpl-3.0/https://choosealicense.com/licenses/agpl-3.0/
Description
We propose a new largescale Chinese event detection dataset based on user reviews, text conversations, and phone conversations in a leading e-commerce platform for food service.
m
General conversation speech datasets in English for Wedding Planning
data.macgence.com
mp3
Updated Jul 31, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Macgence (2024). General conversation speech datasets in English for Wedding Planning [Dataset]. https://data.macgence.com/dataset/general-conversation-speech-datasets-in-english-for-wedding-planning
Explore at:
mp3Available download formats
Dataset updated
Jul 31, 2024
Dataset authored and provided by
Macgence
License
https://data.macgence.com/terms-and-conditionshttps://data.macgence.com/terms-and-conditions
Time period covered
2025
Area covered
Worldwide
Variables measured
Outcome, Call Type, Transcriptions, Audio Recordings, Speaker Metadata, Conversation Topics
Description
Plan memorable weddings with Macgence's English conversation dataset. Ideal for AI, linguistics, and event tech, ensuring relevance and impactful insights!
Data from: PHEME dataset of rumours and non-rumours
figshare.com
bz2
Updated May 30, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arkaitz Zubiaga; Geraldine Wong Sak Hoi; Maria Liakata; Rob Procter (2023). PHEME dataset of rumours and non-rumours [Dataset]. http://doi.org/10.6084/m9.figshare.4010619.v1
Explore at:
bz2Available download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.4010619.v1
Dataset updated
May 30, 2023
Dataset provided by
figshare
Authors
Arkaitz Zubiaga; Geraldine Wong Sak Hoi; Maria Liakata; Rob Procter
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains a collection of Twitter rumours and non-rumours posted during breaking news. The five breaking news provided with the dataset are as follows:* Charlie Hebdo: 458 rumours (22.0%) and 1,621 non-rumours (78.0%).* Ferguson: 284 rumours (24.8%) and 859 non-rumours (75.2%).* Germanwings Crash: 238 rumours (50.7%) and 231 non-rumours (49.3%).* Ottawa Shooting: 470 rumours (52.8%) and 420 non-rumours (47.2%).* Sydney Siege: 522 rumours (42.8%) and 699 non-rumours (57.2%).The data is structured as follows. Each event has a directory, with two subfolders, rumours and non-rumours. These two folders have folders named with a tweet ID. The tweet itself can be found on the 'source-tweet' directory of the tweet in question, and the directory 'reactions' has the set of tweets responding to that source tweet.This dataset was used in the paper 'Learning Reporting Dynamics during Breaking News for Rumour Detection in Social Media' for rumour detection. For more details, please refer to the paper.License: The annotations are provided under a CC-BY license, while Twitter retains the ownership and rights of the content of the tweets.
PHEME dataset for Rumour Detection and Veracity Classification
figshare.com
application/gzip
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elena Kochkina; Maria Liakata; Arkaitz Zubiaga (2023). PHEME dataset for Rumour Detection and Veracity Classification [Dataset]. http://doi.org/10.6084/m9.figshare.6392078.v1
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.6392078.v1
Dataset updated
May 30, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Elena Kochkina; Maria Liakata; Arkaitz Zubiaga
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains a collection of Twitter rumours and non-rumours posted during breaking news.

The data is structured as follows. Each event has a directory, with two subfolders, rumours and non-rumours. These two folders have folders named with a tweet ID. The tweet itself can be found on the 'source-tweet' directory of the tweet in question, and the directory 'reactions' has the set of tweets responding to that source tweet. Also each folder contains ‘annotation.json’ which contains information about veracity of the rumour and ‘structure.json’, which contains information about structure of the conversation.

This dataset is an extension of the PHEME dataset of rumours and non-rumours (https://figshare.com/articles/PHEME_dataset_of_rumours_and_non-rumours/4010619), it contains rumours related to 9 events and each of the rumours is annotated with its veracity value, either True, False or Unverified.

This dataset was used in the paper 'All-in-one: Multi-task Learning for Rumour Verification'. For more details, please refer to the paper.

Code using this dataset can be found on github (https://github.com/kochkinaelena/Multitask4Veracity).

License: The annotations are provided under a CC-BY license, while Twitter retains the ownership and rights of the content of the tweets.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Peking University (2023). MUSIED [Dataset]. https://opendatalab.com/OpenDataLab/MUSIED

MUSIED

OpenDataLab/MUSIED

Explore at:

zip(35559885 bytes)Available download formats

Dataset updated

Feb 5, 2023

Dataset provided by

Peking University

License

https://choosealicense.com/licenses/agpl-3.0/https://choosealicense.com/licenses/agpl-3.0/

Description

We propose a new largescale Chinese event detection dataset based on user reviews, text conversations, and phone conversations in a leading e-commerce platform for food service.

Clear search

Close search

Google apps

Main menu

MUSIED

General conversation speech datasets in English for Wedding Planning

Data from: PHEME dataset of rumours and non-rumours

PHEME dataset for Rumour Detection and Veracity Classification

MUSIEDSee More Versions

OpenDataLab/MUSIED

MUSIED