18 datasets found
  1. h

    Data from: dataset-1

    • huggingface.co
    Updated Sep 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    김정석 (2024). dataset-1 [Dataset]. https://huggingface.co/datasets/privetin/dataset-1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 20, 2024
    Authors
    김정석
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Custom Text Dataset

      Dataset Name
    

    Custom CNN/Daily Mail Summarization Dataset

      Overview
    

    This dataset is a custom version of the CNN/Daily Mail dataset, designed for text summarization tasks. It contains news articles and their corresponding summaries.

      Composition
    

    The dataset consists of two splits:

    Train: 1 custom example Test: 100 examples from the original CNN/Daily Mail dataset

    Each example contains:

    'sentence': The full text of… See the full description on the dataset page: https://huggingface.co/datasets/privetin/dataset-1.

  2. h

    cnn-dailymail-summaries

    • huggingface.co
    Updated Aug 1, 2007
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argilla (2007). cnn-dailymail-summaries [Dataset]. https://huggingface.co/datasets/argilla/cnn-dailymail-summaries
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 1, 2007
    Dataset authored and provided by
    Argilla
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Card for cnn-dailymail-summaries

    This dataset has been created with distilabel. The pipeline script was uploaded to easily reproduce the dataset: cnn_daily_summaries.py. It can be run directly using the CLI: distilabel pipeline run --script "https://huggingface.co/datasets/argilla/cnn-dailymail-summaries/raw/main/cnn_daily_summaries.py"

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that… See the full description on the dataset page: https://huggingface.co/datasets/argilla/cnn-dailymail-summaries.

  3. v

    Daily Toys Gifts Co Ltd Company profile with phone,email, buyers, suppliers,...

    • volza.com
    csv
    Updated Sep 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Volza FZ LLC (2025). Daily Toys Gifts Co Ltd Company profile with phone,email, buyers, suppliers, price, export import shipments. [Dataset]. https://www.volza.com/company-profile/daily-toys-gifts-co-ltd-8256721
    Explore at:
    csvAvailable download formats
    Dataset updated
    Sep 16, 2025
    Dataset authored and provided by
    Volza FZ LLC
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2014 - Sep 30, 2021
    Variables measured
    Count of exporters, Count of importers, Sum of export value, Sum of import value, Count of export shipments, Count of import shipments
    Description

    Credit report of Daily Toys Gifts Co Ltd contains unique and detailed export import market intelligence with it's phone, email, Linkedin and details of each import and export shipment like product, quantity, price, buyer, supplier names, country and date of shipment.

  4. h

    CNN-Daily-Mail-Sinhala

    • huggingface.co
    Updated Jul 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hamza Ziyard (2023). CNN-Daily-Mail-Sinhala [Dataset]. https://huggingface.co/datasets/Hamza-Ziyard/CNN-Daily-Mail-Sinhala
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 7, 2023
    Authors
    Hamza Ziyard
    Description

    Dataset Summary

    This dataset card aims to be creating a new dataset or Sinhala news summarization tasks. It has been generated using [https://huggingface.co/datasets/cnn_dailymail] and google translate.

      Data Instances
    

    For each instance, there is a string for the article, a string for the highlights, and a string for the id. See the CNN / Daily Mail dataset viewer to explore more examples. {'id': '0054d6d30dbcad772e20b22771153a2a9cbeaf62', 'article': '(CNN) -- An American… See the full description on the dataset page: https://huggingface.co/datasets/Hamza-Ziyard/CNN-Daily-Mail-Sinhala.

  5. h

    CNNdaily-Alpca-finetune1500

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mudit, CNNdaily-Alpca-finetune1500 [Dataset]. https://huggingface.co/datasets/gaumudit/CNNdaily-Alpca-finetune1500
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Mudit
    Description

    gaumudit/CNNdaily-Alpca-finetune1500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. Data from: daily data

    • kaggle.com
    Updated Aug 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bowZZZ (2024). daily data [Dataset]. https://www.kaggle.com/datasets/bowzzz/daily-data/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 30, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    bowZZZ
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by bowZZZ

    Released under Apache 2.0

    Contents

  7. h

    ai-vs-human-HuggingFaceTB-SmolLM2-1.7B-Instruct

    • huggingface.co
    Updated Feb 28, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    camil (2014). ai-vs-human-HuggingFaceTB-SmolLM2-1.7B-Instruct [Dataset]. https://huggingface.co/datasets/zcamz/ai-vs-human-HuggingFaceTB-SmolLM2-1.7B-Instruct
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 28, 2014
    Authors
    camil
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    AI vs Human dataset on the CNN Daily mails

      Dataset Description
    

    This dataset showcases pairs of truncated articles and their respective completions, crafted either by humans or an AI language model. Each article was randomly truncated between 25% and 50% of its length. The language model was then tasked with generating a completion that mirrored the characters count of the original human-written continuation.

      Data Fields
    

    'human': The original human-authored… See the full description on the dataset page: https://huggingface.co/datasets/zcamz/ai-vs-human-HuggingFaceTB-SmolLM2-1.7B-Instruct.

  8. w

    Daily-news-tribune (Company) - Reverse Whois Lookup

    • whoisdatacenter.com
    csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, Daily-news-tribune (Company) - Reverse Whois Lookup [Dataset]. https://whoisdatacenter.com/company/daily-news-tribune/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Aug 22, 2025
    Description

    Uncover historical ownership history and changes over time by performing a reverse Whois lookup for the company daily-news-tribune.

  9. h

    All-Daily-News

    • huggingface.co
    Updated Sep 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Papers With Backtest (2024). All-Daily-News [Dataset]. https://huggingface.co/datasets/paperswithbacktest/All-Daily-News
    Explore at:
    Dataset updated
    Sep 3, 2024
    Dataset authored and provided by
    Papers With Backtest
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Information

    This dataset includes news data for various instruments.

      Instruments Included
    

    Stocks, ETFs, Forex, Cryptocurrencies, Commodities and more.

      Dataset Columns
    

    symbols: The symbols in the news, typically representing stock tickers or other financial instruments mentioned in the article. datetime: The date and time when the news article was published, formatted as a string. title: The title of the news article, providing a brief and descriptive… See the full description on the dataset page: https://huggingface.co/datasets/paperswithbacktest/All-Daily-News.

  10. w

    free-daily-news.com - Historical whois Lookup

    • whoisdatacenter.com
    csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, free-daily-news.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/free-daily-news.com/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Sep 16, 2025
    Description

    Explore the historical Whois records related to free-daily-news.com (Domain). Get insights into ownership history and changes over time.

  11. Daily online reach of mass media company Condé Nast in Italy 2019, by device...

    • statista.com
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Daily online reach of mass media company Condé Nast in Italy 2019, by device [Dataset]. https://www.statista.com/statistics/575264/conde-nast-online-reach-by-platform-in-italy/
    Explore at:
    Dataset updated
    Jul 10, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jan 2019 - Dec 2019
    Area covered
    Italy
    Description

    This statistic shows the unique daily internet audience of the mass media company Condé Nast in Italy from January to December 2019. In July 2019, Condé Nast reached over *** thousand individuals via smartphone every day. As of December of the same year, over *** thousand people accessed the website through their smartphones.

  12. h

    Stocks-Daily-Price

    • huggingface.co
    Updated Jul 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Papers With Backtest (2024). Stocks-Daily-Price [Dataset]. https://huggingface.co/datasets/paperswithbacktest/Stocks-Daily-Price
    Explore at:
    Dataset updated
    Jul 2, 2024
    Dataset authored and provided by
    Papers With Backtest
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Information

    This dataset includes daily price data for various stocks.

      Instruments Included
    

    7000+ US Stocks

      Dataset Columns
    

    symbol: The symbol of the stock. date: The date of the data. open: The opening price of the stock. high: The highest price of the stock. low: The lowest price of the stock. close: The closing price of the stock. volume: The volume of the stock. adj_close: The adjusted closing price of the stock.

      Data Splits
    

    The… See the full description on the dataset page: https://huggingface.co/datasets/paperswithbacktest/Stocks-Daily-Price.

  13. h

    csl-daily-sentence-crop

    • huggingface.co
    Updated Jun 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    zecheng (2025). csl-daily-sentence-crop [Dataset]. https://huggingface.co/datasets/ZechengLi19/csl-daily-sentence-crop
    Explore at:
    Dataset updated
    Jun 6, 2025
    Authors
    zecheng
    Description

    ZechengLi19/csl-daily-sentence-crop dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    Commodities-Daily-Price

    • huggingface.co
    Updated Jun 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Papers With Backtest (2024). Commodities-Daily-Price [Dataset]. https://huggingface.co/datasets/paperswithbacktest/Commodities-Daily-Price
    Explore at:
    Dataset updated
    Jun 16, 2024
    Dataset authored and provided by
    Papers With Backtest
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Information

    This dataset includes daily price data for various commodities.

      Instruments Included
    

    BDIY: Baltic Dry Index BEEF: Beef (dollars per pound) BIT: Bitumen (dollars per metric ton) C1: Corn (dollars per bushel) CC1: Cocoa (dollars per metric ton) CHE: Cheese (dollars per pound) CL1: Crude Oil (dollars per barrel) CO1: Brent Crude Oil (dollars per barrel) CRYTR: CRB Index CT1: Cotton (cents per pound) DA: Milk (dollars per hundredweight) DL1: Ethanol… See the full description on the dataset page: https://huggingface.co/datasets/paperswithbacktest/Commodities-Daily-Price.

  15. h

    daily-activity-food

    • huggingface.co
    Updated Mar 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ayush (2025). daily-activity-food [Dataset]. https://huggingface.co/datasets/ayushsinghce/daily-activity-food
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 4, 2025
    Authors
    Ayush
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    ayushsinghce/daily-activity-food dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    incivility-arizona-daily-star-comments

    • huggingface.co
    Updated Mar 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Civility Lab (2023). incivility-arizona-daily-star-comments [Dataset]. https://huggingface.co/datasets/civility-lab/incivility-arizona-daily-star-comments
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 2, 2023
    Dataset authored and provided by
    Civility Lab
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for incivility-arizona-daily-star-comments

    This is a collection of more than 6000 comments on Arizona Daily Star news articles from 2011 that have been manually annotated for various forms of incivility including aspersion, namecalling, sarcasm, and vulgarity.

      Dataset Structure
    

    Each instance in the dataset corresponds to a single comment from a single commenter. An instance's text field contains the text of the comment with any quotes of other commenters… See the full description on the dataset page: https://huggingface.co/datasets/civility-lab/incivility-arizona-daily-star-comments.

  17. h

    CHIRPS-2.0-Global-Daily-p05

    • huggingface.co
    Updated Aug 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mukesh Kumar (2025). CHIRPS-2.0-Global-Daily-p05 [Dataset]. https://huggingface.co/datasets/devsofmukesh/CHIRPS-2.0-Global-Daily-p05
    Explore at:
    Dataset updated
    Aug 11, 2025
    Authors
    Mukesh Kumar
    Description

    devsofmukesh/CHIRPS-2.0-Global-Daily-p05 dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    daily-mobility-generation-benchmark

    • huggingface.co
    Updated Jul 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FIBLAB, Tsinghua University (2025). daily-mobility-generation-benchmark [Dataset]. https://huggingface.co/datasets/tsinghua-fib-lab/daily-mobility-generation-benchmark
    Explore at:
    Dataset updated
    Jul 8, 2025
    Dataset authored and provided by
    FIBLAB, Tsinghua University
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    tsinghua-fib-lab/daily-mobility-generation-benchmark dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
김정석 (2024). dataset-1 [Dataset]. https://huggingface.co/datasets/privetin/dataset-1

Data from: dataset-1

privetin/dataset-1

Custom CNN/Daily Mail Summarization Dataset

Related Article
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 20, 2024
Authors
김정석
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset Card for Custom Text Dataset

  Dataset Name

Custom CNN/Daily Mail Summarization Dataset

  Overview

This dataset is a custom version of the CNN/Daily Mail dataset, designed for text summarization tasks. It contains news articles and their corresponding summaries.

  Composition

The dataset consists of two splits:

Train: 1 custom example Test: 100 examples from the original CNN/Daily Mail dataset

Each example contains:

'sentence': The full text of… See the full description on the dataset page: https://huggingface.co/datasets/privetin/dataset-1.

Search
Clear search
Close search
Google apps
Main menu