5 datasets found
  1. MS1MV2 112x112

    • kaggle.com
    Updated Dec 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yakhyo (2024). MS1MV2 112x112 [Dataset]. https://www.kaggle.com/datasets/yakhyokhuja/ms1m-arcface-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 9, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    yakhyo
    Description

    Please see https://github.com/yakhyo/face-recognition to train Face Recognition model.

    MS1M-ArcFace Dataset Description

    The MS1M-ArcFace dataset is a cleaned and refined version of the original MS-Celeb-1M dataset, specifically curated for face recognition tasks. This dataset was processed to remove noisy and misaligned images, improving its quality and usability in training robust face recognition models.

    Features - Image Size: 112x112 pixels - Classes: 85742 - Aligned: Standardized facial landmarks

    Info - Dataset Origin: Based on the MS-Celeb-1M dataset, originally released by Microsoft - Research. - Purpose: Designed to facilitate research and development in face recognition, particularly for high-accuracy models. - Data: Contains millions of images of celebrity faces, preprocessed and aligned for optimal model training. - Preprocessing: Cleaned and refined using advanced methods to reduce noise, mislabels, and inaccuracies. - Applications: Used in training state-of-the-art models like ArcFace for tasks such as identity verification, facial feature extraction, and more. - License: Users should verify compliance with ethical and licensing requirements before using or distributing the dataset. - This dataset has been extensively used in academic and industrial research for benchmarking and developing cutting-edge face recognition systems.

    Please refer to the original source of this dataset for additional information. It's released here for academic purposes only.

  2. a

    Data from: MS-Celeb-1M: {A} Dataset and Benchmark for Large-Scale Face...

    • academictorrents.com
    bittorrent
    Updated Jun 4, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yandong Guo and Lei Zhang and Yuxiao Hu and Xiaodong He and Jianfeng Gao (2019). MS-Celeb-1M: {A} Dataset and Benchmark for Large-Scale Face Recognition [Dataset]. https://academictorrents.com/details/9e67eb7cc23c9417f39778a8e06cca5e26196a97
    Explore at:
    bittorrent(246390693904)Available download formats
    Dataset updated
    Jun 4, 2019
    Dataset authored and provided by
    Yandong Guo and Lei Zhang and Yuxiao Hu and Xiaodong He and Jianfeng Gao
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is th

  3. Faces ms1m-refine-v2_112x112 TFRecord

    • kaggle.com
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jason Wong (2023). Faces ms1m-refine-v2_112x112 TFRecord [Dataset]. https://www.kaggle.com/datasets/jasonhcwong/faces-ms1m-refine-v2-112x112-tfrecord/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jason Wong
    Description

    This is MS1M-refine-v2 (a.k.a. MS1M-ArcFace) dataset for facial recognition in TFRecord dataset format. It is from InsightFace DatasetZoo.

  4. ms1m-100-v2

    • kaggle.com
    Updated Dec 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    emIkhlas (2023). ms1m-100-v2 [Dataset]. https://www.kaggle.com/datasets/emikhlas/ms1m-100-v2/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 18, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    emIkhlas
    Description

    Dataset

    This dataset was created by emIkhlas

    Contents

  5. a

    Glint360K face recognition dataset

    • academictorrents.com
    bittorrent
    Updated Aug 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    None (2022). Glint360K face recognition dataset [Dataset]. https://academictorrents.com/details/e5f46ee502b9e76da8cc3a0e4f7c17e4000c7b1e
    Explore at:
    bittorrent(128583192913)Available download formats
    Dataset updated
    Aug 13, 2022
    Authors
    None
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    Glint360K contains ** 17091657 ** images of ** 360232 ** individuals. By employing the Patial FC training strategy, baseline models trained on Glint360K can easily achieve state-of-the-art performance. Detailed evaluation results on the large-scale test set (e.g. IFRT, IJB-C and Megaface) are as follows: # 1. Evaluation on IFRT ** r ** denotes the sampling rate of negative class centers. | Backbone | Dataset | African | Caucasian | Indian | Asian | ALL | | —————— | —————- | ——- | ——- | ——— | ——- | ——- | | R50 | MS1M-V3 | 76.24 | 86.21 | 84.44 | 37.43 | 71.02 | | R124 | MS1M-V3 | 81.08 | 89.06 | 87.53 | 38.40 | 74.76 |

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
yakhyo (2024). MS1MV2 112x112 [Dataset]. https://www.kaggle.com/datasets/yakhyokhuja/ms1m-arcface-dataset
Organization logo

MS1MV2 112x112

The MS1M-ArcFace dataset is a cleaned and refined version of the MS-Celeb-1M.

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 9, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
yakhyo
Description

Please see https://github.com/yakhyo/face-recognition to train Face Recognition model.

MS1M-ArcFace Dataset Description

The MS1M-ArcFace dataset is a cleaned and refined version of the original MS-Celeb-1M dataset, specifically curated for face recognition tasks. This dataset was processed to remove noisy and misaligned images, improving its quality and usability in training robust face recognition models.

Features - Image Size: 112x112 pixels - Classes: 85742 - Aligned: Standardized facial landmarks

Info - Dataset Origin: Based on the MS-Celeb-1M dataset, originally released by Microsoft - Research. - Purpose: Designed to facilitate research and development in face recognition, particularly for high-accuracy models. - Data: Contains millions of images of celebrity faces, preprocessed and aligned for optimal model training. - Preprocessing: Cleaned and refined using advanced methods to reduce noise, mislabels, and inaccuracies. - Applications: Used in training state-of-the-art models like ArcFace for tasks such as identity verification, facial feature extraction, and more. - License: Users should verify compliance with ethical and licensing requirements before using or distributing the dataset. - This dataset has been extensively used in academic and industrial research for benchmarking and developing cutting-edge face recognition systems.

Please refer to the original source of this dataset for additional information. It's released here for academic purposes only.

Search
Clear search
Close search
Google apps
Main menu