Facebook
Twitterhttp://www.gnu.org/licenses/lgpl-3.0.htmlhttp://www.gnu.org/licenses/lgpl-3.0.html
Dataset Name: Spam Email Dataset
Description: This dataset contains a collection of email text messages, labeled as either spam or not spam. Each email message is associated with a binary label, where "1" indicates that the email is spam, and "0" indicates that it is not spam. The dataset is intended for use in training and evaluating spam email classification models.
Columns:
text (Text): This column contains the text content of the email messages. It includes the body of the emails along with any associated subject lines or headers.
spam_or_not (Binary): This column contains binary labels to indicate whether an email is spam or not. "1" represents spam, while "0" represents not spam.
Usage: This dataset can be used for various Natural Language Processing (NLP) tasks, such as text classification and spam detection. Researchers and data scientists can train and evaluate machine learning models using this dataset to build effective spam email filters.
Facebook
TwitterThis dataset was created by Dibyajit dhara
Facebook
TwitterThe Email Thread Dataset consists of two main files: email_thread_details and email_thread_summaries. These files collectively offer a comprehensive compilation of email thread information alongside human-generated summaries.
The email_thread_details file provides a detailed perspective on individual email threads, encompassing crucial information such as subject, timestamp, sender, recipients, and the content of the email.
thread_id: A unique identifier for each email thread.subject: Subject of the email thread.timestamp: Timestamp indicating when the message was sent.from: Sender of the email.to: List of recipients of the email.body: Content of the email message.The "to" column is available in both CSV and Pickle (pkl) formats, facilitating convenient access to recipient information as a column of lists of strings.
The email_thread_summaries file contains concise summaries crafted by human annotators for each email thread, offering a high-level overview of the content.
thread_id: A unique identifier for each email thread.summary: A concise summary of the email thread.The dataset is organized into threads and emails. There are a total of 4,167 threads and 21,684 emails, providing a rich source of information for analysis and research purposes.
JSON Files:
****JSON File Features Description****
[
{
"thread_id": [unique identifier],
"subject": "[email thread subject]",
"timestamp": [timestamp in milliseconds],
"from": "[sender's name and identifier]",
"to": [
"[recipient 1]",
"[recipient 2]",
"[recipient 3]",
...
],
"body": "[email content]"
},
...
]
[
{
"thread_id": [unique identifier],
"summary": "[summary content]"
},
...
]
- Dataset
├── CSV
│ ├── email_thread_details.csv
│ └── email_thread_summaries.csv
├── Pickle
│ ├── email_thread_details.pkl
│ └── email_thread_summaries.pkl
└── JSON
├── email_thread_details.json
└── email_thread_summaries.json
This dataset is provided under the MIT License.
The dataset has been anonymized and sanitized to ensure privacy and confidentiality.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is a temporal hypergraph dataset, which here means a sequence of timestamped hyperedges where each hyperedge is a set of nodes. In email communication, messages can be sent to multiple recipients. In this dataset, nodes are email addresses at Enron, and a hyperedge is comprised of the sender and all recipients of the email. Only email addresses from a core set of employees are included. Timestamps are in ISO8601 format.
This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a total of about 0.5M messages. This data was originally made public and posted to the web by the Federal Energy Regulatory Commission during its investigation.
The email dataset was later purchased by Leslie Kaelbling at MIT and turned out to have a number of integrity problems. A number of folks at SRI, notably Melinda Gervasio, worked hard to correct these problems, and it is thanks to them that the dataset is available. The dataset here does not include attachments, and some messages have been deleted "as part of a redaction effort due to requests from affected employees". Invalid email addresses were converted to something of the form user@enron.com whenever possible (i.e., the recipient is specified in some parseable format like "Doe, John" or "Mary K. Smith") and to no_address@enron.com when no recipient was specified.
Some basic statistics of this dataset are:
Component Size, Number
Source: email-Enron dataset
If you use this dataset, please cite these references:
Facebook
Twitterhttps://choosealicense.com/licenses/lgpl-3.0/https://choosealicense.com/licenses/lgpl-3.0/
Phishing Email Dataset
This dataset on Hugging Face is a direct copy of the 'Phishing Email Detection' dataset from Kaggle, shared under the GNU Lesser General Public License 3.0. The dataset was originally created by the user 'Cyber Cop' on Kaggle. For complete details, including licensing and usage information, please visit the original Kaggle page.
Facebook
Twitterhttps://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
1) Data Introduction • The Email Phishing Dataset is designed for phishing email detection using machine learning.
2) Data Utilization (1) Email Phishing Dataset has characteristics that: • All emails were refined and subjected to a custom NLP feature extraction pipeline focused on phishing metrics. • This dataset contains no raw text or headers, only features engineered for model training/testing. (2) Email Phishing Dataset can be used to: • Developing an email detection model: It can be used to train and evaluate AI models that classify normal mail and phishing mail using various characteristics such as email body, subject, and sender. • E-mail security policy and threat analysis research: Analyzing real phishing cases and normal email data to derive the characteristics of phishing attacks, and use them to establish effective email security policies and develop threat response strategies.
Facebook
Twitterhttps://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
To quote the data source: "This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a total of about 0.5M messages. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation. The email dataset was later purchased by Leslie Kaelbling at MIT, and turned out to have a number of integrity problems. A number of folks at SRI, notably Melinda Gervasio, worked hard to correct these problems, and it is thanks to them (not me) that the dataset is available. The dataset here does not include attachments, and some messages have been deleted "as part of a redaction effort due to requests from affected employees". Invalid email addresses were converted to something of the form user@enron.com whenever possible (i.e., recipient is specified in som
Facebook
TwitterBytemine offers access to over 100 million verified personal email addresses for US consumers and professionals. This extensive B2C contact database is designed to support modern outreach, digital marketing, lead generation, and customer engagement across channels that reach people where they are most responsive — their personal inbox.
Unlike traditional work email databases that limit outreach to business hours or corporate filters, personal emails enable more flexible, direct, and often higher-converting communication. Whether you're running direct-to-consumer campaigns, re-engaging inactive users, or enriching existing contact records, Bytemine provides the scale and data quality you need to connect effectively.
Our personal email dataset includes:
100 million+ verified personal email addresses (Gmail, Yahoo, Outlook, etc.) Matched with names, phone numbers, location, and demographic attributes 50+ enriched fields including age range, gender, location, occupation, and consumer behavior signals Optional inclusion of job title, company, and professional details for dual B2B-B2C targeting
All emails are verified and regularly updated to ensure deliverability, reduce bounce rates, and improve sender reputation. Contacts are sourced through direct data licensing agreements with consumer platforms, B2C applications, and verified aggregators, ensuring compliance and reliability.
This data is ideal for:
B2C marketing campaigns (email newsletters, promotions, lifecycle emails) Direct-to-consumer product launches and brand activations Customer re-engagement and loyalty campaigns Lookalike audience creation for paid media CRM enrichment with consumer-facing contact info Identity resolution and cross-channel targeting Data onboarding for ad platforms or audience segmentation Consumer surveys, polling, and research
Bytemine’s personal email dataset empowers your marketing, growth, and data teams with clean, structured, and highly scalable contact information. Each record can be enriched with behavioral and demographic data, enabling advanced personalization and segmentation strategies.
Access is available through:
With flexible delivery options and scalable pricing, Bytemine supports startups, growth teams, agencies, and enterprise platforms looking to expand their reach and drive performance with verified consumer data.
If you're looking to power outreach across consumer inboxes, enrich B2C data, or build a scalable, compliant contact database, Bytemine’s personal email dataset is the fastest way to connect with real people across the United States.
Facebook
TwitterGlobal Email Address & Contact Data Solutions: 293M+ Verified Emails and Phone Numbers for B2B & B2C Outreach Boost your marketing and sales strategies with Forager.ai's Global Contact Data and Email address Data. Our comprehensive database offers access to over 293 million verified email addresses, along with phone number data and detailed B2B Email data and contact information. Whether you're focused on expanding your B2B Email outreach or improving lead generation, our solutions provide the tools you need to engage decision-makers and drive success.
Designed to support your Email data-driven marketing efforts, Forager.ai delivers valuable insights with email data, phone number data, and contact details for both B2B and B2C audiences. Build meaningful connections and leverage high-quality, verified Email data to execute precise and effective outreach strategies.
Core Features of Forager.ai B2B Email Data Solutions: Targeted B2B Email Data: Gain access to a diverse collection of email addresses that help you execute personalized email campaigns targeting key decision-makers across industries.
Comprehensive Phone Number Data: Enhance your sales and telemarketing strategies with our extensive phone number database, perfect for direct outreach and boosting customer engagement.
B2B and B2C Contact Data: Tailor your messaging with B2B contact data and B2C contact Email address data that allow you to effectively connect with C-suite executives, decision-makers, and key consumer groups.
CEO Contact Information: Unlock direct access to CEO contact details, ideal for high-level networking, partnership building, and executive outreach.
Strategic Applications of Forager.ai Data: Online Marketing & Campaigns: Utilize our email address data and phone number information to run targeted online marketing campaigns, increasing conversion rates and boosting outreach effectiveness.
Database Enrichment: Improve your sales databases and CRM systems by enriching them with accurate and up-to-date contact data, supporting more informed decision-making.
B2B Lead Generation: Tap into our rich B2B Email data to expand your business networks, refine your outreach efforts, and generate high-quality leads.
Sales Data Amplification: Supercharge your sales strategies by integrating enriched contact data for better targeting and higher sales conversion rates.
Competitive Market Intelligence: Gain valuable insights into your competitors by leveraging our comprehensive contact data to analyze trends and shifts in the market.
Why Forager.ai Stands Out: Precision & Accuracy: With a 95%+ accuracy rate, Forager.ai ensures that your email data and contact information is always fresh, reliable, and ready to be used for maximum impact.
Global Reach, Local Relevance: Our Email address data solutions cover global markets while allowing you to focus on specific regions, industries, and audience segments tailored to your business needs.
Cost-Effective Solutions: We offer scalable, affordable B2B email data and B2B contact data packages, ensuring you get high-value results without breaking your budget.
Ethical, Compliant Data: We strictly adhere to GDPR guidelines, ensuring that all contact data is ethically sourced and legally compliant, protecting both your business and your customers.
Unlock the Power of Verified Email (Personal Email data & Business Email data) Contact Data with Forager.ai Explore the potential of our 293M+ verified email addresses and phone numbers to elevate your B2B email marketing, sales outreach, and data-driven initiatives. Our contact data solutions are tailored to support your lead generation, sales pipeline, and competitive intelligence efforts, giving you the tools to execute more effective and impactful campaigns.
Top Use Cases for Forager.ai Data Solutions: Lead Generation & B2B Prospecting
Cold B2B Email Outreach
CRM Enrichment & Marketing Automation
Account-Based Marketing (ABM)
Recruiting & Executive Search
Market Research & Competitive Intelligence
Flexible Data Licensing & Access Options: One-Time Data Files available upon request
24/7 API Access for seamless integration
Monthly & Annual Plans tailored to your needs
API Credits Roll Over with no expiration
Reach out to us today to discover how Forager.ai's high-quality Email data and contact data can transform your outreach strategies and drive greater business success.
Facebook
TwitterThis is a collection of text data from 160 emails. For each email, we have included the subject, text, and type of phishing email. The four types of emails included in the dataset are fraud, false positives (legitimate emails), phishing, and commercial spam. 40 of each type of email are in the dataset. This type of data can be used to help build a more complex email spam blocker and could have applications in cybersecurity.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We have curated 7 repositories.The Ling and Enron datasets possess just two features: ‘Subject’ and ‘Body’. The other datasets consists of six features, namely ‘Sender’, ‘Receiver’, ‘Date’, ‘Subject’, ‘Body’, and ‘Urls’.Please cite this dataset:A. I. Champa, M. F. Rabbi, and M. F. Zibran, “Curated datasets and feature analysis for phishing email detection with machine learning,” in 3rd IEEE International Conference on Computing and Machine Intelligence (ICMI), 2024, pp. 1–7 (to appear).or @inproceedings{champa2024curated, title={Curated Datasets and Feature Analysis for Phishing Email Detection with Machine Learning}, author={Champa, Arifa I and Rabbi, Md Fazle and Zibran, Minhaz F}, booktitle={3rd IEEE International Conference on Computing and Machine Intelligence (ICMI)}, pages = {1--7 (to appear)}, year={2024}}
Facebook
TwitterAndrew Wharton's Actionable US Consumer Email Database hosts over 650 million email addresses that have been active within the last 36 months. This database is fully CAN-SPAM compliant and 100% opted-in for Third Party Use.
This Email Address database successfully connects you with your customers and/or prospects at their most recent, deliverable online address. and Increase impression rates, deliverability, and engagement in your digital campaigns.
The Email Address Data is 100% populated with email address, HEMS (MD5, Sha1, Sha256) first name, last name, postal address (primary and secondary), IP Address, Time Stamp(s) for Last Registration, Verification, and First Seen. An enhanced version of the database is available with Date-of-Birth (where available), Phone (mobile and landline) and MAIDs to Hashed email conversion.
The Andrews Wharton Actionable US Consumer Email Database is updated monthly. A complete replacement database or new adds are available as update files.
Contact us at successdelivered@andrewswharton.com or visit us at www.andrewswharton.com to learn more about this dataset.
Facebook
TwitterWeekly Sample of reviewed emails with review score and reasons.
Facebook
TwitterDiscover unparalleled business opportunities with our Targeted Email List, featuring over 2 billion global contacts.
Explore our global B2B contact and company database, providing essential data fields including Name, Website, Contact First Name, Contact Last Name, Job Title, Email Address, Phone Number, Revenue Size, Employee Size, Location, City, State, Country, Zip Code, and additional customizable data fields upon request. Access a comprehensive repository tailored to meet your specific business needs, ensuring you have access to accurate and detailed information for effective networking and targeted outreach.
Facebook
TwitterSuccess.ai offers a comprehensive, enterprise-ready B2B leads data solution, ideal for businesses seeking access to over 150 million verified employee profiles and 170 million work emails. Our data empowers organizations across industries to target key decision-makers, optimize recruitment, and fuel B2B marketing efforts. Whether you're looking for UK B2B data, B2B marketing data, or global B2B contact data, Success.ai provides the insights you need with pinpoint accuracy.
Tailored for B2B Sales, Marketing, Recruitment and more: Our B2B contact data and B2B email data solutions are designed to enhance your lead generation, sales, and recruitment efforts. Build hyper-targeted lists based on job title, industry, seniority, and geographic location. Whether you’re reaching mid-level professionals or C-suite executives, Success.ai delivers the data you need to connect with the right people.
API Features:
Key Categories Served: B2B sales leads – Identify decision-makers in key industries, B2B marketing data – Target professionals for your marketing campaigns, Recruitment data – Source top talent efficiently and reduce hiring times, CRM enrichment – Update and enhance your CRM with verified, updated data, Global reach – Coverage across 195 countries, including the United States, United Kingdom, Germany, India, Singapore, and more.
Global Coverage with Real-Time Accuracy: Success.ai’s dataset spans a wide range of industries such as technology, finance, healthcare, and manufacturing. With continuous real-time updates, your team can rely on the most accurate data available: 150M+ Employee Profiles: Access professional profiles worldwide with insights including full name, job title, seniority, and industry. 170M Verified Work Emails: Reach decision-makers directly with verified work emails, available across industries and geographies, including Singapore and UK B2B data. GDPR-Compliant: Our data is fully compliant with GDPR and other global privacy regulations, ensuring safe and legal use of B2B marketing data.
Key Data Points for Every Employee Profile: Every profile in Success.ai’s database includes over 20 critical data points, providing the information needed to power B2B sales and marketing campaigns: Full Name, Job Title, Company, Work Email, Location, Phone Number, LinkedIn Profile, Experience, Education, Technographic Data, Languages, Certifications, Industry, Publications & Awards.
Use Cases Across Industries: Success.ai’s B2B data solution is incredibly versatile and can support various enterprise use cases, including: B2B Marketing Campaigns: Reach high-value professionals in industries such as technology, finance, and healthcare. Enterprise Sales Outreach: Build targeted B2B contact lists to improve sales efforts and increase conversions. Talent Acquisition: Accelerate hiring by sourcing top talent with accurate and updated employee data, filtered by job title, industry, and location. Market Research: Gain insights into employment trends and company profiles to enrich market research. CRM Data Enrichment: Ensure your CRM stays accurate by integrating updated B2B contact data. Event Targeting: Create lists for webinars, conferences, and product launches by targeting professionals in key industries.
Use Cases for Success.ai's Contact Data - Targeted B2B Marketing: Create precise campaigns by targeting key professionals in industries like tech and finance. - Sales Outreach: Build focused sales lists of decision-makers and C-suite executives for faster deal cycles. - Recruiting Top Talent: Easily find and hire qualified professionals with updated employee profiles. - CRM Enrichment: Keep your CRM current with verified, accurate employee data. - Event Targeting: Create attendee lists for events by targeting relevant professionals in key sectors. - Market Research: Gain insights into employment trends and company profiles for better business decisions. - Executive Search: Source senior executives and leaders for headhunting and recruitment. - Partnership Building: Find the right companies and key people to develop strategic partnerships.
Why Choose Success.ai’s Employee Data? Success.ai is the top choice for enterprises looking for comprehensive and affordable B2B data solutions. Here’s why: Unmatched Accuracy: Our AI-powered validation process ensures 99% accuracy across all data points, resulting in higher engagement and fewer bounces. Global Scale: With 150M+ employee profiles and 170M veri...
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
The dataset consists of a collection of emails categorized into two major classes: spam and not spam. It is designed to facilitate the development and evaluation of spam detection or email filtering systems.
The spam emails in the dataset are typically unsolicited and unwanted messages that aim to promote products or services, spread malware, or deceive recipients for various malicious purposes. These emails often contain misleading subject lines, excessive use of advertisements, unauthorized links, or attempts to collect personal information.
The non-spam emails in the dataset are genuine and legitimate messages sent by individuals or organizations. They may include personal or professional communication, newsletters, transaction receipts, or any other non-malicious content.
The dataset encompasses emails of varying lengths, languages, and writing styles, reflecting the inherent heterogeneity of email communication. This diversity aids in training algorithms that can generalize well to different types of emails, making them robust against different spammer tactics and variations in non-spam email content.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F618942%2F4d1fdedb2827152696dd0c0af05fd8da%2Ff.png?generation=1690286497115141&alt=media" alt="">
includes the following information:
keywords: spam mails dataset, email spam classification, spam or not-spam, spam e-mail database, spam detection system, email spamming data set, spam filtering system, spambase, feature extraction, spam ham email dataset, classifier, machine learning algorithms, cybersecurity, text dataset, sentiment analysis, llm dataset, language modeling, large language models, text classification, text mining dataset, natural language texts, nlp, nlp open-source dataset, text data
Facebook
TwitterA good DATA is crucial for any business or organization to grow the network. This is because all relevant details about the company and user are stored in the database. Your companies have benefited from using our email database to extract their prospect's details.
It is a well-known fact that LinkedIn gives you the opportunity to expand your business network. You can easily connect with your prospects, directly or through mutual connections, by using search keywords related to their name, company, profile, address, etc. However, we're a leading data provider, with us you do not need to do such a thing. Our Professional's email database contains all the necessary business information from your prospects. There are several ways to access them (especially email addresses and phone numbers).
With our service, you can reach over 69 million records in 200+ countries. Our database is well organized and keeps information easily accessible, so you can use it. Easily increase your sales with reliable LinkedIn data that connects you directly to your goal, here we have worked hard to supply quality, reliable, sustainable email databases.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Vietnam email data to connect with prominent professionals, increasing your sales and growing your market presence. Tap into the growing Vietnamese market with our comprehensive Vietnam email data. This premium list provides access to engaged consumers and businesses. As a result, you can broaden your customer base and increase sales. Moreover, our data is carefully compiled and validated. Therefore, you can ensure high deliverability and engagement. Consequently, you can tailor your marketing messages for maximum impact. Furthermore, this valuable resource enables you to build lasting relationships. Finally, List to Data offers this targeted dataset to help you succeed in Vietnam. Vietnam consumer email list empowers you to build valuable relationships with potential customers, fostering brand loyalty and driving repeat business. Access the Vietnamese market with our premium Vietnam consumer email list. This comprehensive resource provides access to a vast network of potential customers. As a result, you can increase your brand visibility and drive sales. Moreover, our data is regularly updated and verified. Therefore, you can improve your marketing ROI. Consequently, you can target specific demographics and regions. Furthermore, this valuable resource allows you to connect with key decision-makers. Finally, List to Data offers this powerful dataset to fuel your business growth in Vietnam. Vietnam business email list is a powerful resource for reaching professionals in Vietnam. This database provides verified leads to ensure your campaigns are effective. Additionally, it is designed to save time and maximize ROI. Moreover, the directory is regularly updated for accuracy. Furthermore, it offers a seamless way to expand your market reach. As a result, you can enhance your marketing efforts with reliable information. In addition, this library of contacts is tailored for both B2B and B2C outreach. Finally, trust List To Data to deliver a dataset that drives results and boosts your market presence.
Facebook
TwitterEnsure the success of your email campaigns with Success.ai’s Email Address Data API. Connect with over 700 million professionals globally, accessing verified email addresses. This API supports real-time data updates, guaranteeing high deliverability and engagement rates for your outreach efforts.
Facebook
TwitterSalutary Data is a boutique, B2B contact and company data provider that's committed to delivering high quality data for sales intelligence, lead generation, marketing, recruiting / HR, identity resolution, and ML / AI. Our database currently consists of 148MM+ highly curated B2B Contacts ( US only), along with over 4M+ companies, and is updated regularly to ensure we have the most up-to-date information.
We can enrich your in-house data ( CRM Enrichment, Lead Enrichment, etc.) and provide you with a custom dataset ( such as a lead list) tailored to your target audience specifications and data use-case. We also support large-scale data licensing to software providers and agencies that intend to redistribute our data to their customers and end-users.
What makes Salutary unique? - We offer our clients a truly unique, one-stop aggregation of the best-of-breed quality data sources. Our supplier network consists of numerous, established high quality suppliers that are rigorously vetted. - We leverage third party verification vendors to ensure phone numbers and emails are accurate and connect to the right person. Additionally, we deploy automated and manual verification techniques to ensure we have the latest job information for contacts. - We're reasonably priced and easy to work with.
Products: API Suite Web UI Full and Custom Data Feeds
Services: Data Enrichment - We assess the fill rate gaps and profile your customer file for the purpose of appending fields, updating information, and/or rendering net new “look alike” prospects for your campaigns. ABM Match & Append - Send us your domain or other company related files, and we’ll match your Account Based Marketing targets and provide you with B2B contacts to campaign. Optionally throw in your suppression file to avoid any redundant records. Verification (“Cleaning/Hygiene”) Services - Address the 2% per month aging issue on contact records! We will identify duplicate records, contacts no longer at the company, rid your email hard bounces, and update/replace titles or phones. This is right up our alley and levers our existing internal and external processes and systems.
Facebook
Twitterhttp://www.gnu.org/licenses/lgpl-3.0.htmlhttp://www.gnu.org/licenses/lgpl-3.0.html
Dataset Name: Spam Email Dataset
Description: This dataset contains a collection of email text messages, labeled as either spam or not spam. Each email message is associated with a binary label, where "1" indicates that the email is spam, and "0" indicates that it is not spam. The dataset is intended for use in training and evaluating spam email classification models.
Columns:
text (Text): This column contains the text content of the email messages. It includes the body of the emails along with any associated subject lines or headers.
spam_or_not (Binary): This column contains binary labels to indicate whether an email is spam or not. "1" represents spam, while "0" represents not spam.
Usage: This dataset can be used for various Natural Language Processing (NLP) tasks, such as text classification and spam detection. Researchers and data scientists can train and evaluate machine learning models using this dataset to build effective spam email filters.