2 datasets found
  1. MS-Celeb-1M

    • opendatalab.com
    zip
    Updated Sep 9, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Microsoft (2022). MS-Celeb-1M [Dataset]. https://opendatalab.com/OpenDataLab/MS-Celeb-1M
    Explore at:
    zip(246390692744 bytes)Available download formats
    Dataset updated
    Sep 9, 2022
    Dataset provided by
    Microsofthttp://microsoft.com/
    Description

    Microsoft Celeb (MS-Celeb-1M) is a dataset of 10 million face images harvested from the Internet for the purpose of developing face recognition technologies. According to Microsoft Research, who created and published the dataset in 2016, MS Celeb is the largest publicly available face recognition dataset in the world, containing over 10 million images of nearly 100,000 individuals. Microsoft's goal in building this dataset was to distribute an initial training dataset of 100,000 individuals' biometric data to accelerate research into recognizing a larger target list of one million people "using all the possibly collected face images of this individual on the web as training data".

  2. a

    Data from: MS-Celeb-1M: {A} Dataset and Benchmark for Large-Scale Face...

    • academictorrents.com
    bittorrent
    Updated Jun 4, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yandong Guo and Lei Zhang and Yuxiao Hu and Xiaodong He and Jianfeng Gao (2019). MS-Celeb-1M: {A} Dataset and Benchmark for Large-Scale Face Recognition [Dataset]. https://academictorrents.com/details/9e67eb7cc23c9417f39778a8e06cca5e26196a97
    Explore at:
    bittorrent(246390693904)Available download formats
    Dataset updated
    Jun 4, 2019
    Dataset authored and provided by
    Yandong Guo and Lei Zhang and Yuxiao Hu and Xiaodong He and Jianfeng Gao
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is th

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Microsoft (2022). MS-Celeb-1M [Dataset]. https://opendatalab.com/OpenDataLab/MS-Celeb-1M
Organization logo

MS-Celeb-1M

OpenDataLab/MS-Celeb-1M

Explore at:
zip(246390692744 bytes)Available download formats
Dataset updated
Sep 9, 2022
Dataset provided by
Microsofthttp://microsoft.com/
Description

Microsoft Celeb (MS-Celeb-1M) is a dataset of 10 million face images harvested from the Internet for the purpose of developing face recognition technologies. According to Microsoft Research, who created and published the dataset in 2016, MS Celeb is the largest publicly available face recognition dataset in the world, containing over 10 million images of nearly 100,000 individuals. Microsoft's goal in building this dataset was to distribute an initial training dataset of 100,000 individuals' biometric data to accelerate research into recognizing a larger target list of one million people "using all the possibly collected face images of this individual on the web as training data".

Search
Clear search
Close search
Google apps
Main menu