3 datasets found
  1. Enron Email Dataset

    • academictorrents.com
    bittorrent
    Updated Aug 26, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enron (2016). Enron Email Dataset [Dataset]. https://academictorrents.com/details/4697a6e1e7841602651b087d84f904d43590d4ff
    Explore at:
    bittorrent(443254787)Available download formats
    Dataset updated
    Aug 26, 2016
    Dataset authored and provided by
    Enronhttp://www.enron.com/
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    To quote the data source: "This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a total of about 0.5M messages. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation. The email dataset was later purchased by Leslie Kaelbling at MIT, and turned out to have a number of integrity problems. A number of folks at SRI, notably Melinda Gervasio, worked hard to correct these problems, and it is thanks to them (not me) that the dataset is available. The dataset here does not include attachments, and some messages have been deleted "as part of a redaction effort due to requests from affected employees". Invalid email addresses were converted to something of the form user@enron.com whenever possible (i.e., recipient is specified in som

  2. w

    Enron Email Dataset

    • data.wu.ac.at
    gz
    Updated Oct 10, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Global (2013). Enron Email Dataset [Dataset]. https://data.wu.ac.at/odso/datahub_io/OTE3MTliODMtNGEyZi00OTQ0LTgzYTQtNmJiZTgwMDg4NGJi
    Explore at:
    gzAvailable download formats
    Dataset updated
    Oct 10, 2013
    Dataset provided by
    Global
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    About

    From distribution page:

    This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a total of about 0.5M messages. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation.

    The email dataset was later purchased by Leslie Kaelbling at MIT, and turned out to have a number of integrity problems. A number of folks at SRI, notably Melinda Gervasio, worked hard to correct these problems, and it is thanks to them (not me) that the dataset is available. The dataset here does not include attachments, and some messages have been deleted "as part of a redaction effort due to requests from affected employees". Invalid email addresses were converted to something of the form user@enron.com whenever possible (i.e., recipient is specified in some parse-able format like "Doe, John" or "Mary K. Smith") and to no_address@enron.com when no recipient was specified.

    I get a number of questions about this corpus each week, which I am unable to answer, mostly because they deal with preparation issues and such that I just don't know about. If you ask me a question and I don't answer, please don't feel slighted.

    I am distributing this dataset as a resource for researchers who are interested in improving current email tools, or understanding how email is currently used. This data is valuable; to my knowledge it is the only substantial collection of "real" email that is public. The reason other datasets are not public is because of privacy concerns. In using this dataset, please be sensitive to the privacy of the people involved (and remember that many of these people were certainly not involved in any of the actions which precipitated the investigation.)

    Downloads

    Download is "about 400Mb, tarred and gzipped".

    Openness

    Unknown.

  3. O

    Enron Email Dataset

    • opendatalab.com
    zip
    Updated Aug 28, 2004
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massachusetts Institute of Technology (2004). Enron Email Dataset [Dataset]. https://opendatalab.com/OpenDataLab/Enron_Email_Dataset
    Explore at:
    zip(1421183736 bytes)Available download formats
    Dataset updated
    Aug 28, 2004
    Dataset provided by
    Massachusetts Institute of Technology
    Description

    This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a total of about 0.5M messages. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation.

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Enron (2016). Enron Email Dataset [Dataset]. https://academictorrents.com/details/4697a6e1e7841602651b087d84f904d43590d4ff
Organization logo

Enron Email Dataset

Explore at:
bittorrent(443254787)Available download formats
Dataset updated
Aug 26, 2016
Dataset authored and provided by
Enronhttp://www.enron.com/
License

https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

Description

To quote the data source: "This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a total of about 0.5M messages. This data was originally made public, and posted to the web, by the Federal Energy Regulatory Commission during its investigation. The email dataset was later purchased by Leslie Kaelbling at MIT, and turned out to have a number of integrity problems. A number of folks at SRI, notably Melinda Gervasio, worked hard to correct these problems, and it is thanks to them (not me) that the dataset is available. The dataset here does not include attachments, and some messages have been deleted "as part of a redaction effort due to requests from affected employees". Invalid email addresses were converted to something of the form user@enron.com whenever possible (i.e., recipient is specified in som

Search
Clear search
Close search
Google apps
Main menu