WHOOPS! Is a dataset and benchmark for visual commonsense. The dataset is comprised of purposefully commonsense-defying images created by designers using publicly-available image generation tools like Midjourney. It contains commonsense-defying image from a wide range of reasons, deviations from expected social norms and everyday knowledge.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
WEIRD
This dataset is used in the paper Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images.
Task description
WEIRD is an extended version of a binary classification subtask of the original English WHOOPS! benchmark. The dataset evaluates the ability to detect violations of commonsense. Commonsense violations are situations that contradict the norm of reality. For example, penguins can't fly, children don't drive cars, guests don't set the table… See the full description on the dataset page: https://huggingface.co/datasets/MERA-evaluation/WEIRD.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
WHOOPS! Is a dataset and benchmark for visual commonsense. The dataset is comprised of purposefully commonsense-defying images created by designers using publicly-available image generation tools like Midjourney. It contains commonsense-defying image from a wide range of reasons, deviations from expected social norms and everyday knowledge.