18 datasets found

P
FaceForensics++ Dataset
paperswithcode.com
opendatalab.com
+1more
Updated Jun 10, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andreas Rössler; Davide Cozzolino; Luisa Verdoliva; Christian Riess; Justus Thies; Matthias Nießner (2021). FaceForensics++ Dataset [Dataset]. https://paperswithcode.com/dataset/faceforensics-1
Explore at:
Dataset updated
Jun 10, 2021
Authors
Andreas Rössler; Davide Cozzolino; Luisa Verdoliva; Christian Riess; Justus Thies; Matthias Nießner
Description
FaceForensics++ is a forensics dataset consisting of 1000 original video sequences that have been manipulated with four automated face manipulation methods: Deepfakes, Face2Face, FaceSwap and NeuralTextures. The data has been sourced from 977 youtube videos and all videos contain a trackable mostly frontal face without occlusions which enables automated tampering methods to generate realistic forgeries.
P
FaceForensics Dataset
paperswithcode.com
opendatalab.com
Updated Feb 19, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andreas Rössler; Davide Cozzolino; Luisa Verdoliva; Christian Riess; Justus Thies; Matthias Nießner (2021). FaceForensics Dataset [Dataset]. https://paperswithcode.com/dataset/faceforensics
Explore at:
Dataset updated
Feb 19, 2021
Authors
Andreas Rössler; Davide Cozzolino; Luisa Verdoliva; Christian Riess; Justus Thies; Matthias Nießner
Description
FaceForensics is a video dataset consisting of more than 500,000 frames containing faces from 1004 videos that can be used to study image or video forgeries. All videos are downloaded from Youtube and are cut down to short continuous clips that contain mostly frontal faces. This dataset has two versions:

Source-to-Target: where the authors reenact over 1000 videos with new facial expressions extracted from other videos, which e.g. can be used to train a classifier to detect fake images or videos.

Selfreenactment: where the authors use Face2Face to reenact the facial expressions of videos with their own facial expressions as input to get pairs of videos, which e.g. can be used to train supervised generative refinement models.
FaceForensics++ Dataset (C23)
kaggle.com
Updated Dec 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xdxd_003 (2024). FaceForensics++ Dataset (C23) [Dataset]. https://www.kaggle.com/datasets/xdxd003/ff-c23/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 7, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Xdxd_003
Description
This is the FaceForensics++ dataset downloaded from original scripts. The dataset contains the following folders: DeepFakeDetection, Deepfakes, Face2Face, FaceShifter, FaceSwap, NeuralTextures, original, csv. Total 7010 files with 7000 mp4 videos (6000 deepfake, 1000 real) and 10 csv files

Folder Info:

DeepFakeDetection - 1000 deepfake videos.

Deepfakes - 1000 deepfake videos.

Face2Face - 1000 deepfake videos.

FaceShifter - 1000 deepfake videos.

FaceSwap - 1000 deepfake videos.

NeuralTextures - 1000 deepfake videos.

original - 1000 real videos.

csv - 10 csv files with metadata about the videos.

The videos are downloaded at moderate compression with c23 option ( c23 corresponds to H.264 compression at quality 23 (constant rate factor, CRF 23) )

More about FaceForensics++:

https://github.com/ondyari/FaceForensics

http://niessnerlab.org/projects/roessler2018faceforensics.html

LICENSE:

https://github.com/ondyari/FaceForensics/blob/master/LICENSE

Terms of Use:

https://kaldir.vc.in.tum.de/faceforensics/webpage/FaceForensics_TOS.pdf
FaceForensics++ youtube c40 subset from 00 to 49
kaggle.com
Updated Nov 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vittorio Stile (2024). FaceForensics++ youtube c40 subset from 00 to 49 [Dataset]. https://www.kaggle.com/datasets/vittoriostile/faceforensics-youtube-c40-subset-from-00-to-49/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 21, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Vittorio Stile
Area covered
YouTube
Description
Dataset

This dataset was created by Vittorio Stile

Released under Other (specified in description)

Contents
f
Accuracies of FaceForensics++ (HQ) (ACC).
plos.figshare.com
xls
Updated Dec 13, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou (2024). Accuracies of FaceForensics++ (HQ) (ACC). [Dataset]. http://doi.org/10.1371/journal.pone.0311366.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0311366.t003
Dataset updated
Dec 13, 2024
Dataset provided by
PLOS ONE
Authors
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
With the advancement of deep forgery techniques, particularly propelled by generative adversarial networks (GANs), identifying deepfake faces has become increasingly challenging. Although existing forgery detection methods can identify tampering details within manipulated images, their effectiveness significantly diminishes in complex scenes, especially in low-quality images subjected to compression. To address this issue, we proposed a novel deep face forgery video detection model named Two-Stream Feature Domain Fusion Network (TSFF-Net). This model comprises spatial and frequency domain feature extraction branches, a feature extraction layer, and a Transformer layer. In the feature extraction module, we utilize the Scharr operator to extract edge features from facial images, while also integrating frequency domain information from these images. This combination enhances the model’s ability to detect low-quality deepfake videos. Experimental results demonstrate the superiority of our method, achieving detection accuracies of 97.7%, 91.0%, 98.9%, and 90.0% on the FaceForensics++ dataset for Deepfake, Face2Face, FaceSwap, and NeuralTextures forgeries, respectively. Additionally, our model exhibits promising results in cross-dataset experiments.. The code used in this study is available at: https://github.com/hwZHc/TSFF-Net.git.
faceforensics-datset
kaggle.com
Updated Apr 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Siddhant Baiswar (2025). faceforensics-datset [Dataset]. https://www.kaggle.com/datasets/siddhantbaiswar/faceforensics-datset/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 22, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Siddhant Baiswar
Description
Dataset

This dataset was created by Siddhant Baiswar

Contents
f
Details of the training parameters.
plos.figshare.com
xls
Updated Dec 13, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou (2024). Details of the training parameters. [Dataset]. http://doi.org/10.1371/journal.pone.0311366.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0311366.t002
Dataset updated
Dec 13, 2024
Dataset provided by
PLOS ONE
Authors
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
With the advancement of deep forgery techniques, particularly propelled by generative adversarial networks (GANs), identifying deepfake faces has become increasingly challenging. Although existing forgery detection methods can identify tampering details within manipulated images, their effectiveness significantly diminishes in complex scenes, especially in low-quality images subjected to compression. To address this issue, we proposed a novel deep face forgery video detection model named Two-Stream Feature Domain Fusion Network (TSFF-Net). This model comprises spatial and frequency domain feature extraction branches, a feature extraction layer, and a Transformer layer. In the feature extraction module, we utilize the Scharr operator to extract edge features from facial images, while also integrating frequency domain information from these images. This combination enhances the model’s ability to detect low-quality deepfake videos. Experimental results demonstrate the superiority of our method, achieving detection accuracies of 97.7%, 91.0%, 98.9%, and 90.0% on the FaceForensics++ dataset for Deepfake, Face2Face, FaceSwap, and NeuralTextures forgeries, respectively. Additionally, our model exhibits promising results in cross-dataset experiments.. The code used in this study is available at: https://github.com/hwZHc/TSFF-Net.git.
f
Results of cross-database experiments on the Celeb-DF dataset (AUC).
plos.figshare.com
xls
Updated Dec 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou (2024). Results of cross-database experiments on the Celeb-DF dataset (AUC). [Dataset]. http://doi.org/10.1371/journal.pone.0311366.t005
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0311366.t005
Dataset updated
Dec 13, 2024
Dataset provided by
PLOS ONE
Authors
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Results of cross-database experiments on the Celeb-DF dataset (AUC).
i
Individualized Deepfake Detection Dataset
ieee-dataport.org
Updated Mar 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mushfiqur Rahman (2025). Individualized Deepfake Detection Dataset [Dataset]. https://ieee-dataport.org/documents/individualized-deepfake-detection-dataset
Explore at:
Dataset updated
Mar 9, 2025
Authors
Mushfiqur Rahman
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
such as FaceForensics++ and DFDC
Ablation study results for branching strategies (ACC).
plos.figshare.com
xls
Updated Dec 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou (2024). Ablation study results for branching strategies (ACC). [Dataset]. http://doi.org/10.1371/journal.pone.0311366.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0311366.t006
Dataset updated
Dec 13, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Hangchuan Zhang; Caiping Hu; Shiyu Min; Hui Sui; Guola Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Ablation study results for branching strategies (ACC).
f
Comparison of deepfake lightweight models.
figshare.com
xls
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nadeem Jabbar; Sohail Masood Bhatti; Muhammad Rashid; Arfan Jaffar; Sheeraz Akram (2025). Comparison of deepfake lightweight models. [Dataset]. http://doi.org/10.1371/journal.pone.0326565.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0326565.t003
Dataset updated
Jul 9, 2025
Dataset provided by
PLOS ONE
Authors
Nadeem Jabbar; Sohail Masood Bhatti; Muhammad Rashid; Arfan Jaffar; Sheeraz Akram
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Deepfakes, synthetic media created using artificial intelligence, threaten the authenticity of digital content. Traditional detection methods, such as Convolutional Neural Networks (CNNs), require substantial computational resources, rendering them impractical for resource-constrained devices like smartphones and IoT systems. This study evaluates a single-layer Kolmogorov-Arnold Network (KAN) with 200 nodes for efficient deepfake classification. Experimental results show that KAN achieves 95.01% accuracy on the FaceForensics++ dataset and 88.32% on the Celeb-DF dataset, while requiring only 52.4 MB of memory, 13.11 million parameters, and 26.21 million FLOPs, significantly less than state-of-the-art CNNs. These verified metrics highlight KAN’s potential for real-time deepfake detection on edge devices. Untested capabilities, such as robustness against adversarial attacks, are proposed for future research. This work aligns with the United Nations Sustainable Development Goals, specifically SDG 9: Industry, Innovation, and Infrastructure and SDG 16: Peace, Justice, and Strong Institutions.
f
Comparison of CNN models vs KAN accuracy.
plos.figshare.com
xls
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The citation is currently not available for this dataset.
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0326565.t002
Dataset updated
Jul 9, 2025
Dataset provided by
PLOS ONE
Authors
Nadeem Jabbar; Sohail Masood Bhatti; Muhammad Rashid; Arfan Jaffar; Sheeraz Akram
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Deepfakes, synthetic media created using artificial intelligence, threaten the authenticity of digital content. Traditional detection methods, such as Convolutional Neural Networks (CNNs), require substantial computational resources, rendering them impractical for resource-constrained devices like smartphones and IoT systems. This study evaluates a single-layer Kolmogorov-Arnold Network (KAN) with 200 nodes for efficient deepfake classification. Experimental results show that KAN achieves 95.01% accuracy on the FaceForensics++ dataset and 88.32% on the Celeb-DF dataset, while requiring only 52.4 MB of memory, 13.11 million parameters, and 26.21 million FLOPs, significantly less than state-of-the-art CNNs. These verified metrics highlight KAN’s potential for real-time deepfake detection on edge devices. Untested capabilities, such as robustness against adversarial attacks, are proposed for future research. This work aligns with the United Nations Sustainable Development Goals, specifically SDG 9: Industry, Innovation, and Infrastructure and SDG 16: Peace, Justice, and Strong Institutions.
UNMASKING DEEPFAKES: A DUAL-STAGE TRANSFORMER MODEL FOR DETECTING...
zenodo.org
Updated Jun 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vincent Froom; Vincent Froom (2025). UNMASKING DEEPFAKES: A DUAL-STAGE TRANSFORMER MODEL FOR DETECTING AI-GENERATED FACIAL VIDEOS [Dataset]. http://doi.org/10.5281/zenodo.15593052
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.15593052
Dataset updated
Jun 4, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Vincent Froom; Vincent Froom
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
As deepfakes pose increasing threats to media authenticity and public trust, accurate detection mechanisms are vital. This paper proposes a novel dual-stage transformer model for detecting deepfake facial videos. The model combines a spatial attention transformer to capture facial features across individual frames and a contextual consistency transformer to track identity coherence across time. Results from benchmark datasets (FaceForensics++, DFDC) show this architecture significantly outperforms conventional CNNs and RNNs in both accuracy and generalization. This research offers a pathway toward more robust and interpretable video forensics solutions.
f
Resource usage comparison established CNN models and proposed KAN model...
plos.figshare.com
xls
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nadeem Jabbar; Sohail Masood Bhatti; Muhammad Rashid; Arfan Jaffar; Sheeraz Akram (2025). Resource usage comparison established CNN models and proposed KAN model [25]. [Dataset]. http://doi.org/10.1371/journal.pone.0326565.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0326565.t001
Dataset updated
Jul 9, 2025
Dataset provided by
PLOS ONE
Authors
Nadeem Jabbar; Sohail Masood Bhatti; Muhammad Rashid; Arfan Jaffar; Sheeraz Akram
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Resource usage comparison established CNN models and proposed KAN model [25].
O
WildDeepfake
opendatalab.com
zip
Updated Oct 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deakin University (2023). WildDeepfake [Dataset]. https://opendatalab.com/OpenDataLab/WildDeepfake
Explore at:
zipAvailable download formats
Dataset updated
Oct 1, 2023
Dataset provided by
Deakin University
Fudan University
License
https://github.com/deepfakeinthewild/deepfake-in-the-wildhttps://github.com/deepfakeinthewild/deepfake-in-the-wild
Description
In recent years, the abuse of a face swap technique called deepfake Deepfake has raised enormous public concerns. Deepfake manipulates deep learning techniques to replace one person's face in a video to someone else's without leaving obvious traces. So far, a large number of deepfake videos (also known as "deepfakes") have been crafted and uploaded to the internet, which calls for the development of effective countermeasures.One promising countermeasure against deepfakes is deepfake detection. Several deepfake datasets have been released to support the training and testing of deepfake detectors, such as DeepfakeDetection and FaceForensics++.While this has greatly advanced deepfake detection, most of the real videos in these datasets are filmed with a few volunteer actors in limited scenes, and the fake videos are crafted by researchers using a few popular deepfake softwares. Detectors developed on these datasets may lose effectiveness when applied to detect the vast variety of deepfake videos in the wild (those uploaded to varies video-sharing websites). To better support detection against real-world deepfakes, in this paper, we introduce a new dataset WildDeepfake, which consists of 7,314 face sequences extracted from 707 deepfake videos that are collected completely from the internet. WildDeepfake is a small dataset that can be used, in addition to existing datasets, to develop more effective detectors against real-world deepfakes.We conduct a systematic evaluation of a set of baseline detection networks on both existing and our WildDeepfake datasets, and show that WildDeepfake is indeed a more challenging dataset, where the detection performance can decrease drastically. We also propose two (eg. 2D and 3D) Attention-based Deepfake Detection Networks (ADDNets) to leverage the attention masks on real/fake faces for improved detection. We empirically verify the effectiveness of ADDNets on both existing and WildDeepfake.
P
DFDC Dataset
paperswithcode.com
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Brian Dolhansky; Russ Howes; Ben Pflaum; Nicole Baram; Cristian Canton Ferrer, DFDC Dataset [Dataset]. https://paperswithcode.com/dataset/dfdc
Explore at:
Authors
Brian Dolhansky; Russ Howes; Ben Pflaum; Nicole Baram; Cristian Canton Ferrer
Description
The DFDC (Deepfake Detection Challenge) is a dataset for deepface detection consisting of more than 100,000 videos.

The DFDC dataset consists of two versions:

Preview dataset. with 5k videos. Featuring two facial modification algorithms. Full dataset, with 124k videos. Featuring eight facial modification algorithms
P
HFFD Dataset
paperswithcode.com
Updated May 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zhiqing Guo; Gaobo Yang; Jiyou Chen; Xingming Sun (2020). HFFD Dataset [Dataset]. https://paperswithcode.com/dataset/hffd
Explore at:
Dataset updated
May 10, 2020
Authors
Zhiqing Guo; Gaobo Yang; Jiyou Chen; Xingming Sun
Description
We build a hybrid fake face (HFF) dataset, which contains eight types of face images. For real face images, three types of face images are randomly selected from three open datasets. They are low-resolution face images from CelebA, high-resolution face images from CelebA-HQ, and face video frames from FaceForensics, respectively. Thus, real face images under internet scenarios are simulated as real as possible. Then, some most representative face manipulation techniques, which include PGGAN and StyleGAN for identity manipulation, Face2Face and Glow for face expression manipulation, and StarGAN for face attribute transfer, are selected to produce fake face images. The HFF dataset is a large fake face dataset, which contains more than 155k face images.
P
Celeb-DF Dataset
paperswithcode.com
Updated Feb 7, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yuezun Li; Xin Yang; Pu Sun; Honggang Qi; Siwei Lyu (2021). Celeb-DF Dataset [Dataset]. https://paperswithcode.com/dataset/celeb-df
Explore at:
Dataset updated
Feb 7, 2021
Authors
Yuezun Li; Xin Yang; Pu Sun; Honggang Qi; Siwei Lyu
Description
Celeb-DF is a large-scale challenging dataset for deepfake forensics. It includes 590 original videos collected from YouTube with subjects of different ages, ethnic groups and genders, and 5639 corresponding DeepFake videos.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Andreas Rössler; Davide Cozzolino; Luisa Verdoliva; Christian Riess; Justus Thies; Matthias Nießner (2021). FaceForensics++ Dataset [Dataset]. https://paperswithcode.com/dataset/faceforensics-1

FaceForensics++ Dataset

Explore at:

Dataset updated

Jun 10, 2021

Authors

Andreas Rössler; Davide Cozzolino; Luisa Verdoliva; Christian Riess; Justus Thies; Matthias Nießner

Description

FaceForensics++ is a forensics dataset consisting of 1000 original video sequences that have been manipulated with four automated face manipulation methods: Deepfakes, Face2Face, FaceSwap and NeuralTextures. The data has been sourced from 977 youtube videos and all videos contain a trackable mostly frontal face without occlusions which enables automated tampering methods to generate realistic forgeries.

Clear search

Close search

Google apps

Main menu

FaceForensics++ Dataset

FaceForensics Dataset

FaceForensics++ Dataset (C23)

Folder Info:

More about FaceForensics++:

LICENSE:

Terms of Use:

FaceForensics++ youtube c40 subset from 00 to 49

Dataset

Contents

Accuracies of FaceForensics++ (HQ) (ACC).

faceforensics-datset

Dataset

Contents

Details of the training parameters.

Results of cross-database experiments on the Celeb-DF dataset (AUC).

Individualized Deepfake Detection Dataset

Ablation study results for branching strategies (ACC).

Comparison of deepfake lightweight models.

Comparison of CNN models vs KAN accuracy.

UNMASKING DEEPFAKES: A DUAL-STAGE TRANSFORMER MODEL FOR DETECTING...

Resource usage comparison established CNN models and proposed KAN model...

WildDeepfake

DFDC Dataset

HFFD Dataset

Celeb-DF Dataset

FaceForensics++ Dataset