100+ datasets found
  1. h

    Video-Sequence-Labeling

    • huggingface.co
    Updated Oct 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    V (2024). Video-Sequence-Labeling [Dataset]. https://huggingface.co/datasets/2nzi/Video-Sequence-Labeling
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 6, 2024
    Authors
    V
    Description

    2nzi/Video-Sequence-Labeling dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. h

    expresso-tagged

    • huggingface.co
    Updated May 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yoach Lacombe (2024). expresso-tagged [Dataset]. https://huggingface.co/datasets/ylacombe/expresso-tagged
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 1, 2024
    Authors
    Yoach Lacombe
    Description

    ylacombe/expresso-tagged dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    top_tagging

    • huggingface.co
    Updated Jun 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deep Learning for Particle Physicists (2022). top_tagging [Dataset]. https://huggingface.co/datasets/dl4phys/top_tagging
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 14, 2022
    Dataset authored and provided by
    Deep Learning for Particle Physicists
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Top Quark Tagging

      Dataset Summary
    

    Top Quark Tagging is a dataset of Monte Carlo simulated events produced by proton-proton collisions at the Large Hadron Collider. The top-quark signal and mixed quark-gluon background jets are produced with Pythia8 with its default tune for a center-of-mass energy of 14 TeV. Multiple interactions and pile-up are ignored. The leading 200 jet constituent four-momenta (E,px,py,pz) (E, p_x, p_y, p_z) (E,pxโ€‹,pyโ€‹,pzโ€‹)are storedโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/dl4phys/top_tagging.

  4. h

    Title-Tagging

    • huggingface.co
    Updated Apr 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Parichha (2020). Title-Tagging [Dataset]. https://huggingface.co/datasets/Deependra/Title-Tagging
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 8, 2020
    Authors
    Parichha
    Description

    Deependra/Title-Tagging dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    SLT-Task2-Post-ASR-Speaker-Tagging

    • huggingface.co
    Updated Jun 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ASR-LLM Group: Generative Error Correction (2024). SLT-Task2-Post-ASR-Speaker-Tagging [Dataset]. https://huggingface.co/datasets/GenSEC-LLM/SLT-Task2-Post-ASR-Speaker-Tagging
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 22, 2024
    Dataset authored and provided by
    ASR-LLM Group: Generative Error Correction
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Name: Dataset for ASR Speaker-Tagging Corrections (Speaker Diarization)

      Description
    

    This dataset is pairs of erroneous ASR output and speaker tagging, which are generated from a ASR system and speaker diarization system. Each source erroneous transcription is paired with human-annotated transcription, which has correct transcription and speaker tagging. SEGment-wise Long-form Speech Transcription annotation (SegLST), the file format used in the CHiME challengesโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/GenSEC-LLM/SLT-Task2-Post-ASR-Speaker-Tagging.

  6. h

    think-tag-fixed-dataset-v2

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shaaf Salman, think-tag-fixed-dataset-v2 [Dataset]. https://huggingface.co/datasets/shaafsalman/think-tag-fixed-dataset-v2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Shaaf Salman
    Description

    shaafsalman/think-tag-fixed-dataset-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    anime-tagging-dataset

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brandon Wu, anime-tagging-dataset [Dataset]. https://huggingface.co/datasets/bwu2018/anime-tagging-dataset
    Explore at:
    Authors
    Brandon Wu
    Description

    bwu2018/anime-tagging-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    Tags-Generation-dataset

    • huggingface.co
    Updated Sep 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zineb Meftah (2024). Tags-Generation-dataset [Dataset]. https://huggingface.co/datasets/zino36/Tags-Generation-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    Zineb Meftah
    Description

    ๐Ÿง  Tags Generated Dataset

      Dataset Card for Tags Generated Dataset
    

    Curated by: Zineb MEFTAH
    Language(s) (NLP): English

      ๐Ÿ“ Dataset Summary
    

    The Keyword Extraction Dataset includes 2,000 samples pairing extracted keywords with their corresponding full-text news articles. It is optimized for tasks such as keyword extraction, text analysis, and model fine-tuning.

    Structure: 2,000 samples of keywords paired with full diversed themes news articles. Column 1: Extractedโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/zino36/Tags-Generation-dataset.

  9. h

    fineweb-2-compliant-tag

    • huggingface.co
    Updated Aug 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Swiss AI Initiative (2025). fineweb-2-compliant-tag [Dataset]. https://huggingface.co/datasets/swiss-ai/fineweb-2-compliant-tag
    Explore at:
    Dataset updated
    Aug 2, 2025
    Dataset authored and provided by
    Swiss AI Initiative
    Description

    swiss-ai/fineweb-2-compliant-tag dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    raw-tags-v1

    • huggingface.co
    Updated Aug 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    G (2024). raw-tags-v1 [Dataset]. https://huggingface.co/datasets/taixpavel/raw-tags-v1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 19, 2024
    Authors
    G
    Description

    taixpavel/raw-tags-v1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    libretta-tts-merged-dataset-tags

    • huggingface.co
    Updated May 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GA (2024). libretta-tts-merged-dataset-tags [Dataset]. https://huggingface.co/datasets/GrigoriiA/libretta-tts-merged-dataset-tags
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 19, 2024
    Authors
    GA
    Description

    GrigoriiA/libretta-tts-merged-dataset-tags dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    libritts-r-text-tags-v3

    • huggingface.co
    Updated Feb 14, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yoach Lacombe (2024). libritts-r-text-tags-v3 [Dataset]. https://huggingface.co/datasets/ylacombe/libritts-r-text-tags-v3
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 14, 2024
    Authors
    Yoach Lacombe
    Description

    ylacombe/libritts-r-text-tags-v3 dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    wikitext-tags-dataset-v2

    • huggingface.co
    Updated Oct 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyungji Kim (2025). wikitext-tags-dataset-v2 [Dataset]. https://huggingface.co/datasets/hyungjikim/wikitext-tags-dataset-v2
    Explore at:
    Dataset updated
    Oct 18, 2025
    Authors
    Hyungji Kim
    Description

    hyungjikim/wikitext-tags-dataset-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    expresso-tags

    • huggingface.co
    Updated Nov 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tristan Hooper (2024). expresso-tags [Dataset]. https://huggingface.co/datasets/Qurtana/expresso-tags
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 14, 2024
    Authors
    Tristan Hooper
    Description

    Qurtana/expresso-tags dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    adult-image-tagging

    • huggingface.co
    Updated Jun 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nikola Katsarov (2025). adult-image-tagging [Dataset]. https://huggingface.co/datasets/escapeboy/adult-image-tagging
    Explore at:
    Dataset updated
    Jun 21, 2025
    Authors
    Nikola Katsarov
    Description

    escapeboy/adult-image-tagging dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    emns-tagged-text-v2

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yoach Lacombe, emns-tagged-text-v2 [Dataset]. https://huggingface.co/datasets/ylacombe/emns-tagged-text-v2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Yoach Lacombe
    Description

    ylacombe/emns-tagged-text-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    NER-special-tagging

    • huggingface.co
    Updated Feb 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Junhyeong Lee (2025). NER-special-tagging [Dataset]. https://huggingface.co/datasets/Junhyeong86/NER-special-tagging
    Explore at:
    Dataset updated
    Feb 12, 2025
    Authors
    Junhyeong Lee
    Description

    Junhyeong86/NER-special-tagging dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    jas-v2-tags

    • huggingface.co
    Updated Aug 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    jun (2024). jas-v2-tags [Dataset]. https://huggingface.co/datasets/junjuice0/jas-v2-tags
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 28, 2024
    Authors
    jun
    Description

    junjuice0/jas-v2-tags dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    ethical_world_affecting_cot-tags

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    constanza fierro, ethical_world_affecting_cot-tags [Dataset]. https://huggingface.co/datasets/cfierro/ethical_world_affecting_cot-tags
    Explore at:
    Authors
    constanza fierro
    Area covered
    World
    Description

    cfierro/ethical_world_affecting_cot-tags dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    wikitext-tags-dataset-v3-unpatched

    • huggingface.co
    Updated Oct 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyungji Kim (2025). wikitext-tags-dataset-v3-unpatched [Dataset]. https://huggingface.co/datasets/hyungjikim/wikitext-tags-dataset-v3-unpatched
    Explore at:
    Dataset updated
    Oct 10, 2025
    Authors
    Hyungji Kim
    Description

    hyungjikim/wikitext-tags-dataset-v3-unpatched dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
V (2024). Video-Sequence-Labeling [Dataset]. https://huggingface.co/datasets/2nzi/Video-Sequence-Labeling

Video-Sequence-Labeling

2nzi/Video-Sequence-Labeling

Explore at:
17 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 6, 2024
Authors
V
Description

2nzi/Video-Sequence-Labeling dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu