14 datasets found
  1. h

    ag_news

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wangrongsheng, ag_news [Dataset]. https://huggingface.co/datasets/wangrongsheng/ag_news
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    wangrongsheng
    License

    https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

    Description

    Dataset Card for "ag_news"

      Dataset Summary
    

    AG is a collection of more than 1 million news articles. News articles have been gathered from more than 2000 news sources by ComeToMyHead in more than 1 year of activity. ComeToMyHead is an academic news search engine which has been running since July, 2004. The dataset is provided by the academic comunity for research purposes in data mining (clustering, classification, etc), information retrieval (ranking, search, etc)… See the full description on the dataset page: https://huggingface.co/datasets/wangrongsheng/ag_news.

  2. h

    ag_news

    • huggingface.co
    Updated Dec 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SetFit (2024). ag_news [Dataset]. https://huggingface.co/datasets/SetFit/ag_news
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 5, 2024
    Dataset authored and provided by
    SetFit
    Description

    SetFit/ag_news dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. T

    ag_news_subset

    • tensorflow.org
    Updated Dec 6, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). ag_news_subset [Dataset]. http://identifiers.org/arxiv:1509.01626
    Explore at:
    Dataset updated
    Dec 6, 2022
    Description

    AG is a collection of more than 1 million news articles. News articles have been gathered from more than 2000 news sources by ComeToMyHead in more than 1 year of activity. ComeToMyHead is an academic news search engine which has been running since July, 2004. The dataset is provided by the academic comunity for research purposes in data mining (clustering, classification, etc), information retrieval (ranking, search, etc), xml, data compression, data streaming, and any other non-commercial activity. For more information, please refer to the link http://www.di.unipi.it/~gulli/AG_corpus_of_news_articles.html .

    The AG's news topic classification dataset is constructed by Xiang Zhang (xiang.zhang@nyu.edu) from the dataset above. It is used as a text classification benchmark in the following paper: Xiang Zhang, Junbo Zhao, Yann LeCun. Character-level Convolutional Networks for Text Classification. Advances in Neural Information Processing Systems 28 (NIPS 2015).

    The AG's news topic classification dataset is constructed by choosing 4 largest classes from the original corpus. Each class contains 30,000 training samples and 1,900 testing samples. The total number of training samples is 120,000 and testing 7,600.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('ag_news_subset', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  4. h

    ag_news

    • huggingface.co
    Updated Oct 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ankit (2023). ag_news [Dataset]. https://huggingface.co/datasets/Ankit1057/ag_news
    Explore at:
    Dataset updated
    Oct 17, 2023
    Authors
    Ankit
    Description

    Ankit1057/ag_news dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    ag_news

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ag_news [Dataset]. https://huggingface.co/datasets/contemmcm/ag_news
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Marcio Monteiro
    Description

    contemmcm/ag_news dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. d

    Pretrained sentence BERT models AG News Results

    • data.dtu.dk
    txt
    Updated Jul 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Beatrix Miranda Ginn Nielsen (2024). Pretrained sentence BERT models AG News Results [Dataset]. http://doi.org/10.11583/DTU.21276648.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 26, 2024
    Dataset provided by
    Technical University of Denmark
    Authors
    Beatrix Miranda Ginn Nielsen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Raw result files used for tables and figures in Hubness Reduction Improves Sentence-BERT Semantic Spaces (DOI: coming)

    For more info see: https://github.com/bemigini/hubness-reduction-sentence-bert

  7. d

    sts_bert_microsoft-mpnet-base AG News Results

    • data.dtu.dk
    txt
    Updated Jul 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Beatrix Miranda Ginn Nielsen (2024). sts_bert_microsoft-mpnet-base AG News Results [Dataset]. http://doi.org/10.11583/DTU.21268422.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 26, 2024
    Dataset provided by
    Technical University of Denmark
    Authors
    Beatrix Miranda Ginn Nielsen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Raw result files used for tables and figures in Hubness Reduction Improves Sentence-BERT Semantic Spaces (DOI: coming)

    For more info see: https://github.com/bemigini/hubness-reduction-sentence-bert

  8. d

    sts_bert_distilroberta-base AG News results

    • data.dtu.dk
    json
    Updated Jul 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Beatrix Miranda Ginn Nielsen (2024). sts_bert_distilroberta-base AG News results [Dataset]. http://doi.org/10.11583/DTU.21387282.v1
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jul 26, 2024
    Dataset provided by
    Technical University of Denmark
    Authors
    Beatrix Miranda Ginn Nielsen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Raw result files used for tables and figures in Hubness Reduction Improves Sentence-BERT Semantic Spaces (DOI: coming)

    For more info see: https://github.com/bemigini/hubness-reduction-sentence-bert

  9. h

    autoeval-eval-ag_news-default-5b1609-64790145529

    • huggingface.co
    Updated Apr 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Evaluation on the Hub (2024). autoeval-eval-ag_news-default-5b1609-64790145529 [Dataset]. https://huggingface.co/datasets/autoevaluate/autoeval-eval-ag_news-default-5b1609-64790145529
    Explore at:
    Dataset updated
    Apr 22, 2024
    Dataset authored and provided by
    Evaluation on the Hub
    Description

    autoevaluate/autoeval-eval-ag_news-default-5b1609-64790145529 dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. Processed AG News

    • kaggle.com
    zip
    Updated Nov 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pooja_S (2021). Processed AG News [Dataset]. https://www.kaggle.com/pooja987/processed-ag-news
    Explore at:
    zip(14466533 bytes)Available download formats
    Dataset updated
    Nov 28, 2021
    Authors
    Pooja_S
    Description

    Dataset

    This dataset was created by Pooja_S

    Contents

  11. h

    autoeval-eval-ag_news-default-8f9ba7-59715145371

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Evaluation on the Hub, autoeval-eval-ag_news-default-8f9ba7-59715145371 [Dataset]. https://huggingface.co/datasets/autoevaluate/autoeval-eval-ag_news-default-8f9ba7-59715145371
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    Evaluation on the Hub
    Description

    Dataset Card for AutoTrain Evaluator

    This repository contains model predictions generated by AutoTrain for the following task and dataset:

    Task: Summarization Model: AleBurzio/long-t5-base-govreport Dataset: ag_news Config: default Split: test

    To run new evaluation jobs, visit Hugging Face's automatic model evaluator.

      Contributions
    

    Thanks to @AdinaY for evaluating this model.

  12. h

    autoeval-staging-eval-project-ag_news-22fb867e-11605544

    • huggingface.co
    Updated Jul 25, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Evaluation on the Hub (2023). autoeval-staging-eval-project-ag_news-22fb867e-11605544 [Dataset]. https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-ag_news-22fb867e-11605544
    Explore at:
    Dataset updated
    Jul 25, 2023
    Dataset authored and provided by
    Evaluation on the Hub
    Description

    autoevaluate/autoeval-staging-eval-project-ag_news-22fb867e-11605544 dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    sft-dataset-v1.5

    • huggingface.co
    Updated Nov 27, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sft-dataset-v1.5 [Dataset]. https://huggingface.co/datasets/miya-99999/sft-dataset-v1.5
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 27, 2024
    Authors
    miya
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    This dataset was created by GPT-4o and other public datasets. Therefore, we follow the OpenAI API terms of use and license for each dataset. public datasets

    abisee/cnn_dailymail fancyzhx/ag_news JulesBelveze/tldr_news HuggingFaceH4/instruction-dataset

  14. h

    autoeval-staging-eval-project-1c7ef613-7224756

    • huggingface.co
    Updated Nov 29, 2004
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Evaluation on the Hub (2004). autoeval-staging-eval-project-1c7ef613-7224756 [Dataset]. https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-1c7ef613-7224756
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 29, 2004
    Dataset authored and provided by
    Evaluation on the Hub
    Description

    Dataset Card for AutoTrain Evaluator

    This repository contains model predictions generated by AutoTrain for the following task and dataset:

    Task: Multi-class Text Classification Model: nateraw/bert-base-uncased-ag-news Dataset: ag_news

    To run new evaluation jobs, visit Hugging Face's automatic evaluation service.

      Contributions
    

    Thanks to @abhishek for evaluating this model.

  15. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
wangrongsheng, ag_news [Dataset]. https://huggingface.co/datasets/wangrongsheng/ag_news

ag_news

AG’s News Corpus

wangrongsheng/ag_news

Explore at:
301 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
wangrongsheng
License

https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

Description

Dataset Card for "ag_news"

  Dataset Summary

AG is a collection of more than 1 million news articles. News articles have been gathered from more than 2000 news sources by ComeToMyHead in more than 1 year of activity. ComeToMyHead is an academic news search engine which has been running since July, 2004. The dataset is provided by the academic comunity for research purposes in data mining (clustering, classification, etc), information retrieval (ranking, search, etc)… See the full description on the dataset page: https://huggingface.co/datasets/wangrongsheng/ag_news.

Search
Clear search
Close search
Google apps
Main menu