Facebook
TwitterDisclaimer: We do not own this dataset. DeepFashion dataset is a public dataset which can be accessed through its website. This dataset was used to evaluate Marqo-FashionCLIP and Marqo-FashionSigLIP - see details below.
Marqo-FashionSigLIP Model Card
Marqo-FashionSigLIP leverages Generalised Contrastive Learning (GCL) which allows the model to be trained on not just text descriptions but also categories, style, colors, materials, keywords and fine-details to provide highly relevantโฆ See the full description on the dataset page: https://huggingface.co/datasets/Marqo/deepfashion-multimodal.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
## Overview
Deepfashion is a dataset for object detection tasks - it contains Clothes annotations for 240 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [Public Domain license](https://creativecommons.org/licenses/Public Domain).
Facebook
TwitterDataset Card for "deepfashion"
More Information needed
Facebook
TwitterThe dataset is the re-organized and re-labeled version of the In-shop Clothes Retrieval Benchmark of DeepFashion. It includes 13,752 pairs of images and masks.
The original data was presented in the form of a deep file hierarchy and had to be re-organized as only image and mask folders under the data directory. All masks had three channels, they were reduced to one channel. Not all images had masks in the original dataset. Images without masks were discarded. You can find the script that achieves these tasks here.
Liu, Ziwei and Luo, Ping and Qiu, Shi and Wang, Xiaogang and Tang, Xiaoou, DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016.
Original source: DeepFashion: In-shop Clothes Retrieval
One can find the notebook where this dataset is used.
License info: http://mmlab.ie.cuhk.edu.hk/projects/DeepFashion/DeepFashionAgreement.pdf
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The DeepFashion-MultiModal Dataset is a leading dataset for fashion AI and computer vision, featuring high-quality, aligned images and textual attributes for fashion recognition, retrieval, and generative AI applications.
Facebook
TwitterDeepFashion2 Original (with Dataframes) includes the following: - The original DeepFashion2 Dataset, comprising JSON files with boundary box annotations, segmentation masks, and landmark points. The /annos sub-folder contains these files, while the /images sub-folder holds images for training and validation. The test folder includes images without annotation information. - The dataframes that have been generated through the steps in the notebook Resizing DeepFashion2 (256x256) and Basic EDA, converting all the JSON information for every image into 3 dataframes - train, test and validation for the ease of pre-processing, visualisation, and modelling.
@article{DeepFashion2,
author = {Yuying Ge and Ruimao Zhang and Lingyun Wu and Xiaogang Wang and Xiaoou Tang and Ping Luo},
title={A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images},
journal={CVPR},
year={2019}
}
NOTE - THE CITATION INFORMATION SHOWN BELOW UNDER DOI Citation IS AUTOGENERATED BY KAGGLE. USE THE ABOVE BIBTEX WHILE CITING THE DATASOURCE
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Original data contains 44k. This one only contains images from front.
Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang, Shuai Yang, Haonan Qiu, Wayne Wu, Chen Change Loy and Ziwei Liu
In ACM Transactions on Graphics (Proceedings of SIGGRAPH), 2022.
From MMLab@NTU affliated with S-Lab, Nanyang Technological University and SenseTime Research.
https://github.com/yumingj/DeepFashion-MultiModal/raw/main/assets/logo.png">
[Project Page] | [Paper] | [Code] | [Demo Video]
DeepFashion-MultiModal is a large-scale high-quality human dataset with rich multi-modal annotations. It has the following properties: 1. It contains 44,096 high-resolution human images, including 12,701 full body human images. 2. For each full body images, we manually annotate the human parsing labels of 24 classes. 3. For each full body images, we manually annotate the keypoints. 4. We extract DensePose for each human image. 5. Each image is manually annotated with attributes for both clothes shapes and textures. 6. We provide a textual description for each image.
@article{jiang2022text2human,
title={Text2Human: Text-Driven Controllable Human Image Generation},
author={Jiang, Yuming and Yang, Shuai and Qiu, Haonan and Wu, Wayne and Loy, Chen Change and Liu, Ziwei},
journal={ACM Transactions on Graphics (TOG)},
volume={41},
number={4},
articleno={162},
pages={1--11},
year={2022},
publisher={ACM New York, NY, USA},
doi={10.1145/3528223.3530104},
}
@inproceedings{liuLQWTcvpr16DeepFashion,
author = {Liu, Ziwei and Luo, Ping and Qiu, Shi and Wang, Xiaogang and Tang, Xiaoou},
title = {DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations},
booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2016}
}
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
DeepFashion MultiModal Parts2Whole
Dataset Details
Dataset Description
This human image dataset comprising about 41,500 reference-target pairs. Each pair in this dataset includes multiple reference images, which encompass human pose images (e.g., OpenPose, Human Parsing, DensePose), various aspects of human appearance (e.g., hair, face, clothes, shoes) with their short textual labels, and a target image featuring the same individual (ID) in the same outfitโฆ See the full description on the dataset page: https://huggingface.co/datasets/huanngzh/DeepFashion-MultiModal-Parts2Whole.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
DeepFashion2 is a comprehensive fashion dataset. It contains 491K diverse images of 13 popular clothing categories from both commercial shopping stores and consumers. It has a total of 801K clothing items, where each item in an image is labeled with scale, occlusion, zoom-in, viewpoint, category, style, bounding box, dense landmarks, and per-pixel masks. There are also 873K Commercial-Consumer clothes pairs.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22418539%2Fc38fe86b17de5c19d044f4ffb7d7c203%2Fdeepfashion2_bigbang.png?generation=1763420762376839&alt=media" alt="">
The richness of the annotations makes DeepFashion2 suitable for multiple complex tasks, often simultaneously (as in a multi-head model like a Keypoint R-CNN):
bounding_box for each clothing item.segmentation mask for each item.landmarks for each item. This is the core task for virtual try-on and detailed clothing analysis.The dataset is split into a training set (391K images), a validation set (34k images), and a test set (67k images).
Figure 1: Examples of DeepFashion2.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22418539%2Fde20008e6aa8f794c4b30d8333df5b55%2Fannotation.jpg?generation=1763420915372772&alt=media" alt="">
From (1) to (4), each row represents clothes images with different variations. At each row, the left three columns represent clothes from commercial stores, while the right three columns are from customers. In each group, the three images indicate three levels of difficulty with respect to the corresponding variation. Each item is annotated with landmarks and masks.
The data is organized into train, validation, and test sets. Each image (.jpg) has a corresponding annotation file (.json) in the annos directory.
train/imagetrain/annosvalidation/imagevalidation/annostest/imageEach .json annotation file is organized as a dictionary with image-level information and a variable number of item entries.
source: A string, where 'shop' indicates the image is from a commercial store and 'user' indicates the image is from a consumer.pair_id: A number. Images from the same shop and their corresponding consumer-taken images share the same pair_id.item_1, item_2, ... item_n: Each item found in the image has its own entry with the following keys:
category_name: A string indicating the category of the item.category_id: A number (1-13) corresponding to the category name:
short_sleeved_shirtlong_sleeved_shirtshort_sleeved_outwearlong_sleeved_outwearvestslingshortstrousersskirtshort_sleeved_dresslong_sleeved_dressvest_dresssling_dressstyle: A number (0, 1, 2, ...) to distinguish between clothing items from images with the same pair_id. See the "Understanding Pairs" section for more details.bounding_box: [x1, y1, x2, y2] coordinates of the upper-left and lower-right corners.landmarks: [x1, y1, v1, ..., xn, yn, vn], where v is visibility:
v=2: visiblev=1: occlusionv=0: not labeledsegmentation: [[x1, y1, ... xn, yn], [poly2], ...] A list of polygons.scale: 1 (small), 2 (modest), or 3 (large).occlusion: 1 (slight/none), 2 (medium), or 3 (heavy).zoom_in: 1 (no), 2 (medium), or 3 (large).viewpoint: 1 (no wear), 2 (frontal), or 3 (side/back).A total of 294 landmarks are defined across the 13 categories. The numbers in the figure below represent the order of landmark annotations for each category.
Figure 2: Definitions of landmarks and skeletons.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22418539%2Ff3d67659bb9e66b17d6238ace90f1fc2%2Fcls.jpg?generation=1763420957293495&alt=media" alt="">
A key feature of this dataset is the link between "shop" (commercial) and "user" (consumer) images for clothing retrieval.
pair_id are...
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
## Overview
DeepFashion Clothing is a dataset for classification tasks - it contains Clothing annotations for 3,169 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [MIT license](https://creativecommons.org/licenses/MIT).
Facebook
Twitterzhengchong/DeepFashion dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
The above dataset is derived and pre-processded from the DeepFashion Multimodal Dataset for :
https://github.com/yumingj/DeepFashion-MultiModal Text2Human: Text-Driven Controllable Human Image Generation Yuming Jiang, Shuai Yang, Haonan Qiu, Wayne Wu, Chen Change Loy and Ziwei Liu In ACM Transactions on Graphics (Proceedings of SIGGRAPH), 2022.
The above dataset is specifically preprocessed based on Characterisitc Input for Image -Text Pairs consisting: - -Gender Classification - -Feature Engineering - - ViT Input Compose
Dataset Contains : - - fashion_model.pth (Trained on the given dataset for pairing) - - male_fashion (Image Directory for all Male Images) - - female_fashion (Image Directory for all Female Images) - - df_male.csv (Text for all Male Images) - - df_female.csv (Text for all Female Images) - - female_front.csv (Text specified for Front-full body Images) - - features.db (sqlite database with Image-Text pairs)
Facebook
TwitterDataset Card for "deepfashion-multimodal-descriptions"
More Information needed
Facebook
TwitterThanThoai9x/Deepfashion dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset was created by jaoc@jaoc!
Released under MIT
Facebook
TwitterThis dataset was created by Rachit Kumar Singh
Facebook
TwitterDaHaDaHa/deepfashion dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterThis dataset was created by Jayleeli
Facebook
TwitterThis dataset was created by Tuan Nguyen Van Anh
Facebook
TwitterDisclaimer: We do not own this dataset. DeepFashion dataset is a public dataset which can be accessed through its website. This dataset was used to evaluate Marqo-FashionCLIP and Marqo-FashionSigLIP - see details below.
Marqo-FashionSigLIP Model Card
Marqo-FashionSigLIP leverages Generalised Contrastive Learning (GCL) which allows the model to be trained on not just text descriptions but also categories, style, colors, materials, keywords and fine-details to provide highly relevantโฆ See the full description on the dataset page: https://huggingface.co/datasets/Marqo/deepfashion-multimodal.