100+ datasets found

S
Email Spam Statistics 2025: Shocking Insights and Real Risks
sqmagazine.co.uk
Updated Sep 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SQ Magazine (2025). Email Spam Statistics 2025: Shocking Insights and Real Risks [Dataset]. https://sqmagazine.co.uk/spam-statistics/
Explore at:
Dataset updated
Sep 10, 2025
Dataset authored and provided by
SQ Magazine
License
https://sqmagazine.co.uk/privacy-policy/https://sqmagazine.co.uk/privacy-policy/
Time period covered
Jan 1, 2024 - Dec 31, 2025
Area covered
Global
Description
Email, text, and call spam remain major threats nowadays. Nearly half of all daily emails are unwanted, with users worldwide encountering boosted volumes of phishing and scam content. In retail and financial services, spam disrupts customer trust and inflates cybersecurity budgets. Meanwhile, call-based scams cost consumers time and mental strain...
Spam share of global email traffic 2011-2023
statista.com
Updated Sep 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Spam share of global email traffic 2011-2023 [Dataset]. https://www.statista.com/statistics/420400/spam-email-traffic-share-annual/
Explore at:
Dataset updated
Sep 1, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
In 2023, nearly 45.6 percent of all e-mails worldwide were identified as spam, down from almost 49 percent in 2022. While remaining a big part of the e-mail traffic, since 2011, the share of spam e-mails has decreased significantly. In 2023, the highest volume of spam e-mails was registered in May, approximately 50 percent of e-mail traffic worldwide.
Spam e-mail: leading countries of origin of spam 2024
statista.com
Updated Sep 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Spam e-mail: leading countries of origin of spam 2024 [Dataset]. https://www.statista.com/statistics/263086/countries-of-origin-of-spam/
Explore at:
Dataset updated
Sep 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2024
Area covered
Worldwide
Description
In 2024, Russia ranked first by its share of unsolicited spam e-mails. Overall, ***** percent of global spam e-mails originated from IPs in Russia. The Mainland China ranked second, with ***** percent. The United States followed, accounting for over *** percent of global unsolicited spam e-mails during the measured period.
Global spam categories 2020
statista.com
Updated Dec 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Global spam categories 2020 [Dataset]. https://www.statista.com/statistics/263452/most-common-content-of-spam-messages-worldwide-by-category/
Explore at:
Dataset updated
Dec 10, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2020
Area covered
Worldwide
Description
In 2020, healthcare-related spam e-mails accounted for nearly 33 percent of total spam volume. Spam e-mails with adult content were the second-most common category, around 27 percent. Dating-related junk mail generated approximately 10 percent of spam messages in the same period.
h
generated-e-mail-spam
huggingface.co
Updated Sep 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unique Data (2023). generated-e-mail-spam [Dataset]. https://huggingface.co/datasets/UniqueData/generated-e-mail-spam
Explore at:
Dataset updated
Sep 23, 2023
Authors
Unique Data
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
The dataset consists of a CSV file containing of 300 generated email spam messages. Each row in the file represents a separate email message, its title and text. The dataset aims to facilitate the analysis and detection of spam emails. The dataset can be used for various purposes, such as training machine learning algorithms to classify and filter spam emails, studying spam email patterns, or analyzing text-based features of spam messages.
a
SMS Spam Collection Data Set
academictorrents.com
bittorrent
Updated Nov 28, 2015
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tiago A. Almeida and José María Gómez Hidalgo (2015). SMS Spam Collection Data Set [Dataset]. https://academictorrents.com/details/25932ba42d983dd7b4474d8f59ab56cdc25d9107
Explore at:
bittorrent(695379)Available download formats
Dataset updated
Nov 28, 2015
Dataset authored and provided by
Tiago A. Almeida and José María Gómez Hidalgo
License
https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
Description
==Data Set Information: This corpus has been collected from free or free for research sources at the Internet: -> A collection of 425 SMS spam messages was manually extracted from the Grumbletext Web site. This is a UK forum in which cell phone users make public claims about SMS spam messages, most of them without reporting the very spam message received. The identification of the text of spam messages in the claims is a very hard and time-consuming task, and it involved carefully scanning hundreds of web pages. The Grumbletext Web site is: [Web Link]. -> A subset of 3,375 SMS randomly chosen ham messages of the NUS SMS Corpus (NSC), which is a dataset of about 10,000 legitimate messages collected for research at the Department of Computer Science at the National University of Singapore. The messages largely originate from Singaporeans and mostly from students attending the University. These messages were collected from volunteers who were made aware that their contributions were
SMS Spam Detection Dataset
kaggle.com
Updated Mar 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vishakh Patel (2024). SMS Spam Detection Dataset [Dataset]. https://www.kaggle.com/datasets/vishakhdapat/sms-spam-detection-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 22, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Vishakh Patel
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Description: In an era where communication is predominantly digital, SMS spam poses a significant challenge, cluttering inboxes and sometimes even posing security risks. Our "SMS Spam Detection Dataset" is tailored to empower machine learning enthusiasts, data scientists, and researchers to tackle this pervasive issue using the power of AI. This dataset is meticulously curated to provide a robust foundation for developing and benchmarking spam detection models.

Dataset Overview: The dataset comprises two columns: 'Text' and 'Label', containing the SMS content and corresponding labels ('ham' for regular messages and 'spam' for unsolicited messages), respectively. With a diverse collection of messages, this dataset serves as an ideal playground for exploring various text processing and machine learning techniques.

Potential Uses: Spam Detection Models: Use the dataset to train binary classification models capable of distinguishing between spam and ham messages with high accuracy. Natural Language Processing (NLP) Techniques: Experiment with different NLP methodologies, including tokenization, stemming, lemmatization, and the application of word embeddings or transformers to understand the nuances of SMS language. Feature Engineering: Explore how different features, such as message length, punctuation usage, and keyword frequency, can impact model performance. Model Benchmarking: Compare the effectiveness of various machine learning algorithms, from classical approaches like Naive Bayes and SVM to advanced deep learning models like LSTM and BERT.

Challenges & Opportunities: While the dataset offers a straightforward binary classification task, the real challenge lies in dealing with the nuances of natural language, including slang, abbreviations, and the evolving nature of spam tactics. Innovators in the field can explore advanced techniques like transfer learning and semi-supervised models to push the boundaries of what's possible in spam detection.
t
Data from: Spam Mails Dataset
dbrepo.datalab.tuwien.ac.at
Updated Apr 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bernal, Nicolas (2025). Spam Mails Dataset [Dataset]. http://doi.org/10.82556/bexb-5283
Explore at:
Unique identifier
https://doi.org/10.82556/bexb-5283
Dataset updated
Apr 19, 2025
Authors
Bernal, Nicolas
Time period covered
2025
Description
Preprocessed data derived from the "spam-mails" dataset, containing email messages labeled as spam or ham. Each record includes a unique identifier from the original dataset and an experiment_id indicating its assignment to a specific data split (training, validation, or test) used in this experiment. The email content has been lemmatized and cleaned to remove noise such as punctuation, special characters, and stopwords, ensuring consistent input for embedding and model training. Original data source: https://www.kaggle.com/datasets/venky73/spam-mails-dataset
j
Data from: Persuasion Sentences in Spam Email (PerSentSE)
portalcienciaytecnologia.jcyl.es
Updated 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jáñez-Martino, Francisco; Barrón-Cedeño, Alberto; ALAIZ-RODRÍGUEZ, ROCÍO; González-Castro, Víctor; Jáñez-Martino, Francisco; Barrón-Cedeño, Alberto; ALAIZ-RODRÍGUEZ, ROCÍO; González-Castro, Víctor (2025). Persuasion Sentences in Spam Email (PerSentSE) [Dataset]. https://portalcienciaytecnologia.jcyl.es/documentos/67a9c7c719544708f8c7246c
Explore at:
Dataset updated
2025
Authors
Jáñez-Martino, Francisco; Barrón-Cedeño, Alberto; ALAIZ-RODRÍGUEZ, ROCÍO; González-Castro, Víctor; Jáñez-Martino, Francisco; Barrón-Cedeño, Alberto; ALAIZ-RODRÍGUEZ, ROCÍO; González-Castro, Víctor
Description
How to Access:

To access this dataset, please contact Francisco Janez via email at francisco.janez@unileon.es. Access will be granted based on specific requests.

Purpose:The PerSentSE corpus was developed to study persuasive techniques in spam emails. It includes 130 emails randomly selected from the SpamArchive2122 dataset, which contains over 20,000 spam emails in English.

Methodology:

Segmentation: Emails were divided into sentences using the NLTK library.

Annotation: Eight persuasive techniques, along with a "non-persuasion" class, were identified. Two expert annotators labeled an initial subset of emails to measure inter-annotator agreement, achieving a final acceptable level (γ = 0.63).

Corpus Statistics:

Total sentences: 1,075

Persuasive sentences: 216 (20.1%)

Persuasion Distribution by Email Sections (Table 7):

Subject lines: 35.59% persuasive, with an average of 1.62 techniques.

Greeting section: 54.17% persuasive, averaging 1.46 techniques.

Email body: 82.46% persuasive, with 5.51 techniques on average.

Farewell section: 31.43% persuasive, averaging 1.45 techniques.

Co-occurrence of Techniques (Figure 2):Some persuasive techniques frequently appeared together:

Appeal to Fear/Prejudice with Loaded Language: 25 instances.

Exaggeration/Minimization with Loaded Language: 24 instances.

Appeal to Fear/Prejudice with Exaggeration/Minimization: 20 instances.

Findings:The body section of emails concentrates the highest number of persuasive elements, contrary to earlier studies focusing on subject lines alone. This suggests that spam emails rely heavily on persuasive content in their main text.
Spam Email Classification
kaggle.com
Updated Jul 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Somesh Sharma (2020). Spam Email Classification [Dataset]. https://www.kaggle.com/somesh24/spambase/activity
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 9, 2020
Dataset provided by
Kaggle
Authors
Somesh Sharma
Description
SPAM E-mail Database

The “spam” concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography… Our collection of spam e-mails came from our postmaster and individuals who had filed spam. Our collection of non-spam e-mails came from filed work and personal e-mails, and hence the word ‘george’ and the area code ‘650’ are indicators of non-spam. These are useful when constructing a personalized spam filter. One would either have to blind such non-spam indicators or get a very wide collection of non-spam to generate a general purpose spam filter.

Attribute Information:

The last column denotes whether the e-mail was considered spam (1) or not (0), i.e. unsolicited commercial e-mail. Most of the attributes indicate whether a particular word or character was frequently occurring in the e-mail. The run-length attributes (55-57) measure the length of sequences of consecutive capital letters.

For the statistical measures of each attribute, see the end of this file. Here are the definitions of the attributes:

48 continuous real [0,100] attributes of type word_freq_WORD = percentage of words in the e-mail that match WORD, i.e. 100 * (number of times the WORD appears in the e-mail) / total number of words in e-mail. A “word” in this case is any string of alphanumeric characters bounded by non-alphanumeric characters or end-of-string.

6 continuous real [0,100] attributes of type char_freq_CHAR = percentage of characters in the e-mail that match CHAR, i.e. 100 * (number of CHAR occurrences) / total characters in e-mail

1 continuous real [1,…] attribute of type capital_run_length_average = average length of uninterrupted sequences of capital letters

1 continuous integer [1,…] attribute of type capital_run_length_longest = length of longest uninterrupted sequence of capital letters

1 continuous integer [1,…] attribute of type capital_run_length_total = sum of length of uninterrupted sequences of capital letters = total number of capital letters in the e-mail

1 nominal {0,1} class attribute of type spam = denotes whether the e-mail was considered spam (1) or not (0), i.e. unsolicited commercial e-mail.
h
sms_spam
huggingface.co
Updated Aug 28, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UC Irvine (2023). sms_spam [Dataset]. https://huggingface.co/datasets/ucirvine/sms_spam
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 28, 2023
Dataset authored and provided by
UC Irvine
License
https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/
Description
Dataset Card for [Dataset Name]

Dataset Summary

The SMS Spam Collection v.1 is a public set of SMS labeled messages that have been collected for mobile phone spam research. It has one collection composed by 5,574 English, real and non-enconded messages, tagged according being legitimate (ham) or spam.

Supported Tasks and Leaderboards

[More Information Needed]

Languages

English

Dataset Structure Data Instances

[More Information… See the full description on the dataset page: https://huggingface.co/datasets/ucirvine/sms_spam.
t
Spam Mails Dataset - FAIR experiment
test.researchdata.tuwien.ac.at
application/x-hdf5 +3
Updated Apr 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nicolas Bernal; Nicolas Bernal; Nicolas Bernal; Nicolas Bernal (2025). Spam Mails Dataset - FAIR experiment [Dataset]. http://doi.org/10.70124/0e1sf-saz86
Explore at:
application/x-hdf5, png, txt, csvAvailable download formats
Unique identifier
https://doi.org/10.70124/0e1sf-saz86
Dataset updated
Apr 25, 2025
Dataset provided by
TU Wien
Authors
Nicolas Bernal; Nicolas Bernal; Nicolas Bernal; Nicolas Bernal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Context

The Spam Mail dataset is a collection of 5.171 emails that have been classified as spam or ham (non-spam). This dataset was originally created in 2006 for research purposes in the field of spam detection and filtering using machine learning techniques, specifically a Naive Bayes classifier as described in the paper "Spam Filtering with Naive Bayes - Which Naive Bayes?" by Metsis, Androutsopoulos, and Paliouras.

The data was created using mainly the inbox of 6 users of the company "Enron" for the "ham" emails, and the "spam" emails were collected from various sources, including the SpamAssassin corpus, the Honeypot project, the spam collection of Bruce Guenter, and spam collected by the authors themselves.

The emails were preprocessed to remove any html tags, and emails with non-latin characters were removed to avoid any possible bias since all "ham" emails are written with latin characters.

The original data can be found in CSV format on Kaggle at: https://www.kaggle.com/datasets/venky73/spam-mails-dataset/data

Project description

In this project we will use the Spam Mail dataset to train a Neural Network model to classify emails as spam or ham. The dataset will be further preprocessed to remove any unnecessary characters like stopwords and punctuation.

The emails will also be tokenized and converted into a format suitable for training the model, but this last step will be performed in the code itself so it is not included in the dataset.

Files

In this repository you will find the following files:

- README.md: Project overview, dataset source, structure, and dependency information.

- confusion_matrix.png: A confusion matrix that shows the performance of the model on the test set.

- evaluation_metrics.txt: Text summary of evaluation metrics: accuracy, precision, recall, and F1-score.

- test_predictions.csv: A CSV file that contains the predictions of the model on the test set.

- top_spam_words.png: A bar chart showing the top 10 most frequent words in correctly predicted spam emails.

- spam_classifier.h5: The trained model file, which can be used to make predictions on new emails.
h
email-spam-classification
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unique Data, email-spam-classification [Dataset]. https://huggingface.co/datasets/UniqueData/email-spam-classification
Explore at:
Authors
Unique Data
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
Email Spam Classification

The dataset consists of a collection of emails categorized into two major classes: spam and not spam. It is designed to facilitate the development and evaluation of spam detection or email filtering systems. The spam emails in the dataset are typically unsolicited and unwanted messages that aim to promote products or services, spread malware, or deceive recipients for various malicious purposes. These emails often contain misleading subject lines… See the full description on the dataset page: https://huggingface.co/datasets/UniqueData/email-spam-classification.
Spam: share of global e-mail traffic monthly 2014-2023
snapriase.com
statista.com
Updated Jun 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Spam: share of global e-mail traffic monthly 2014-2023 [Dataset]. https://snapriase.com/?ref=instantly.ai&_=%2Fstatistics%2F420391%2Fspam-email-traffic-share%2F%23Ukyz32eyY2jt6x%2FTYXg86FNLs8466yMq
Explore at:
Dataset updated
Jun 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Jan 2014 - Dec 2023
Area covered
Worldwide
Description
Spam messages accounted for over **** percent of e-mail traffic in December 2023. Russia generated the largest share of unsolicited spam e-mails in 2022, with **** percent of global spam e-mails originating from the country. Spam worldwide It is almost impossible to think about e-mail without considering the issue of spam, which usually includes billions of promotional e-mails marketers send daily. As of January 2023, the United States had the highest number of spam e-mails sent daily. While many e-mail users believe such content belongs in their spam folder, marketing e-mails are generally harmless if annoying to the user. Malicious spam Phishing e-mails remain one of the primary attack vectors for cybercriminals. On average, around ** percent of businesses worldwide experience four to six successful cyber attacks in one year. Another ** percent said they became victims of more than ** bulk phishing attacks. More than half of the companies said these phishing attacks resulted in consumer or client data breaches.
h
spam-text-messages-dataset
huggingface.co
Updated Jul 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unique Data (2023). spam-text-messages-dataset [Dataset]. https://huggingface.co/datasets/UniqueData/spam-text-messages-dataset
Explore at:
Dataset updated
Jul 31, 2023
Authors
Unique Data
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
The SMS spam dataset contains a collection of text messages. The dataset includes a diverse range of spam messages, including promotional offers, fraudulent schemes, phishing attempts, and other forms of unsolicited communication. Each SMS message is represented as a string of text, and each entry in the dataset also has a link to the corresponding screenshot. The dataset's content represents real-life examples of spam messages that users encounter in their everyday communication.
Facebook: spam content removal 2017-2025
statista.com
Updated Sep 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Facebook: spam content removal 2017-2025 [Dataset]. https://www.statista.com/statistics/1013843/facebook-spam-content-removal-quarter/
Explore at:
Dataset updated
Sep 4, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
Facebook removed 165 million pieces of spam in the second quarter of 2025, down from 366 million pieces in the previous quarter. The fourth quarter of 2019 saw almost three billion pieces of spam being removed from the social network. Meta Platforms state that spam is not allowed on Facebook, and defines spam as deceptive or annoying content used to drive engagement.
h
all-scam-spam
huggingface.co
Updated Sep 2, 2002
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fred Zhang (2002). all-scam-spam [Dataset]. https://huggingface.co/datasets/FredZhang7/all-scam-spam
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 2, 2002
Authors
Fred Zhang
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This is a large corpus of 42,619 preprocessed text messages and emails sent by humans in 43 languages. is_spam=1 means spam and is_spam=0 means ham. 1040 rows of balanced data, consisting of casual conversations and scam emails in ≈10 languages, were manually collected and annotated by me, with some help from ChatGPT.

Some preprcoessing algorithms

spam_assassin.js, followed by spam_assassin.py enron_spam.py

Data composition Description

To make the text… See the full description on the dataset page: https://huggingface.co/datasets/FredZhang7/all-scam-spam.
E-mail spam rate worldwide 2012-2018
statista.com
Updated Jul 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2022). E-mail spam rate worldwide 2012-2018 [Dataset]. https://www.statista.com/statistics/270899/global-e-mail-spam-rate/
Explore at:
Dataset updated
Jul 7, 2022
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
The statistic shows the global e-mail spam rate from 2012 to 2018. In the most recently observed period, it was found that spam accounted for 55 percent of all e-mail messages, same as during the previous year.
Data from: Spam Mail Dataset
kaggle.com
Updated Sep 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chirag Singh (2025). Spam Mail Dataset [Dataset]. https://www.kaggle.com/datasets/cschiragsingh999/spam-mail-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 8, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Chirag Singh
Description
Dataset

This dataset was created by Chirag Singh

Contents

Spam Mail Dataset
c
spam Price Prediction Data
coinbase.com
Updated Oct 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). spam Price Prediction Data [Dataset]. https://www.coinbase.com/price-prediction/base-spam-4053
Explore at:
Dataset updated
Oct 18, 2025
Variables measured
Growth Rate, Predicted Price
Measurement technique
User-defined projections based on compound growth. This is not a formal financial forecast.
Description
This dataset contains the predicted prices of the asset spam over the next 16 years. This data is calculated initially using a default 5 percent annual growth rate, and after page load, it features a sliding scale component where the user can then further adjust the growth rate to their own positive or negative projections. The maximum positive adjustable growth rate is 100 percent, and the minimum adjustable growth rate is -100 percent.

Facebook

Twitter

Click to copy link

Link copied

Cite

SQ Magazine (2025). Email Spam Statistics 2025: Shocking Insights and Real Risks [Dataset]. https://sqmagazine.co.uk/spam-statistics/

Email Spam Statistics 2025: Shocking Insights and Real Risks

Explore at:

Dataset updated

Sep 10, 2025

Dataset authored and provided by

SQ Magazine

License

https://sqmagazine.co.uk/privacy-policy/https://sqmagazine.co.uk/privacy-policy/

Time period covered

Jan 1, 2024 - Dec 31, 2025

Area covered

Global

Description

Email, text, and call spam remain major threats nowadays. Nearly half of all daily emails are unwanted, with users worldwide encountering boosted volumes of phishing and scam content. In retail and financial services, spam disrupts customer trust and inflates cybersecurity budgets. Meanwhile, call-based scams cost consumers time and mental strain...

Clear search

Close search

Google apps

Main menu

Email Spam Statistics 2025: Shocking Insights and Real Risks

Spam share of global email traffic 2011-2023

Spam e-mail: leading countries of origin of spam 2024

Global spam categories 2020

generated-e-mail-spam

SMS Spam Collection Data Set

SMS Spam Detection Dataset

Data from: Spam Mails Dataset

Data from: Persuasion Sentences in Spam Email (PerSentSE)

Spam Email Classification

sms_spam

Spam Mails Dataset - FAIR experiment

Context

Project description

Files

email-spam-classification

Spam: share of global e-mail traffic monthly 2014-2023

spam-text-messages-dataset

Facebook: spam content removal 2017-2025

all-scam-spam

E-mail spam rate worldwide 2012-2018

Data from: Spam Mail Dataset

Dataset

Contents

spam Price Prediction Data

Email Spam Statistics 2025: Shocking Insights and Real Risks