100+ datasets found
  1. Normalized Dataset

    • kaggle.com
    zip
    Updated Jun 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hemanth S (2022). Normalized Dataset [Dataset]. https://www.kaggle.com/datasets/hemanth012/normalized-dataset
    Explore at:
    zip(1009250933 bytes)Available download formats
    Dataset updated
    Jun 15, 2022
    Authors
    Hemanth S
    Description

    Dataset

    This dataset was created by Hemanth S

    Contents

  2. Songs Normalize Dataset

    • kaggle.com
    zip
    Updated Apr 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammed Ashraf Shaaban Shahata (2025). Songs Normalize Dataset [Dataset]. https://www.kaggle.com/datasets/mohammedashraf2004/songs-normalize-dataset
    Explore at:
    zip(95910 bytes)Available download formats
    Dataset updated
    Apr 2, 2025
    Authors
    Mohammed Ashraf Shaaban Shahata
    Description

    Dataset

    This dataset was created by Mohammed Ashraf Shaaban Shahata

    Released under Other (specified in description)

    Contents

  3. Normalization Template

    • kaggle.com
    zip
    Updated Nov 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kiro Youssef (2023). Normalization Template [Dataset]. https://www.kaggle.com/datasets/kiroyoussef/normalization-template
    Explore at:
    zip(22 bytes)Available download formats
    Dataset updated
    Nov 2, 2023
    Authors
    Kiro Youssef
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Kiro Youssef

    Released under Apache 2.0

    Contents

  4. Data from: Scaling and Normalization

    • kaggle.com
    Updated Feb 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Engr Yasir Hussain (2024). Scaling and Normalization [Dataset]. https://www.kaggle.com/datasets/mryasirturi/scaling-and-normalization
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 2, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Engr Yasir Hussain
    Description

    Dataset

    This dataset was created by Engr Yasir Hussain

    Contents

  5. new 512x512x64 no normalize no augment

    • kaggle.com
    zip
    Updated Mar 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mark A Lavin (2021). new 512x512x64 no normalize no augment [Dataset]. https://www.kaggle.com/markalavin/new-512x512x64-no-normalize-no-augment
    Explore at:
    zip(37680122587 bytes)Available download formats
    Dataset updated
    Mar 11, 2021
    Authors
    Mark A Lavin
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Mark A Lavin

    Released under CC0: Public Domain

    Contents

  6. OP-l2-normalization dataset

    • kaggle.com
    zip
    Updated Jul 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    amjad ali2018 (2024). OP-l2-normalization dataset [Dataset]. https://www.kaggle.com/datasets/amjadali2018/op-l2-normalization-dataset
    Explore at:
    zip(417646704 bytes)Available download formats
    Dataset updated
    Jul 11, 2024
    Authors
    amjad ali2018
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by amjad ali2018

    Released under Apache 2.0

    Contents

  7. LJSpeech Raw Normalize

    • kaggle.com
    zip
    Updated Nov 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nguyễn Thanh (2025). LJSpeech Raw Normalize [Dataset]. https://www.kaggle.com/datasets/lookingformyself/ljspeech-raw-normalize
    Explore at:
    zip(3330874463 bytes)Available download formats
    Dataset updated
    Nov 8, 2025
    Authors
    Nguyễn Thanh
    Description

    Dataset

    This dataset was created by Nguyễn Thanh

    Contents

  8. Tachygraphy Microtext Normalization

    • kaggle.com
    zip
    Updated Mar 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archisman Karmakar (2025). Tachygraphy Microtext Normalization [Dataset]. https://www.kaggle.com/datasets/archismancoder/dataset-tachygraphy
    Explore at:
    zip(4565198 bytes)Available download formats
    Dataset updated
    Mar 3, 2025
    Authors
    Archisman Karmakar
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Archisman Karmakar

    Released under MIT

    Contents

  9. COVID-19 Daily Data

    • kaggle.com
    zip
    Updated May 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    osa4olli (2020). COVID-19 Daily Data [Dataset]. https://www.kaggle.com/osa4olli/covid19-data-normalized-from-csse
    Explore at:
    zip(5872546 bytes)Available download formats
    Dataset updated
    May 10, 2020
    Authors
    osa4olli
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Simple normalization of the data provided by the CSSE daily reports on github. Preparations I made: - Normalizing the Timestamp (since they provide four different formats) - Pruning the column labels (Region/Country => Region_Country, etc) - Adding a country code column

    Photo by CDC on Unsplash

  10. Google Text Normalization Challenge

    • kaggle.com
    zip
    Updated Apr 26, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google Natural Language Understanding Research (2017). Google Text Normalization Challenge [Dataset]. https://www.kaggle.com/datasets/google-nlu/text-normalization/discussion
    Explore at:
    zip(1523170770 bytes)Available download formats
    Dataset updated
    Apr 26, 2017
    Dataset provided by
    Googlehttp://google.com/
    Authors
    Google Natural Language Understanding Research
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Challenge Description

    This dataset and accompanying paper present a challenge to the community: given a large corpus of written text aligned to its normalized spoken form, train an RNN to learn the correct normalization function. That is, a date written "31 May 2014" is spoken as "the thirty first of may twenty fourteen." We present a dataset of general text where the normalizations were generated using an existing text normalization component of a text-to-speech (TTS) system. This dataset was originally released open-source here and is reproduced on Kaggle for the community.

    The Data

    The data in this directory are the English language training, development and test data used in Sproat and Jaitly (2016).

    The following divisions of data were used:

    • Training: output_1 through output_21 (corresponding to output-000[0-8]?-of-00100 in the original dataset)

    • Runtime eval: output_91 (corresponding to output-0009[0-4]-of-00100 in the original dataset)

    • Test data: output_96 (corresponding to output-0009[5-9]-of-00100 in the original dataset)

    In practice for the results reported in the paper only the first 100,002 lines of output-00099-of-00100 were used (for English).

    Lines with "

  11. VCTK Raw Normalize

    • kaggle.com
    zip
    Updated Nov 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nguyễn Thanh (2025). VCTK Raw Normalize [Dataset]. https://www.kaggle.com/datasets/lookingformyself/vctk-raw-normalize
    Explore at:
    zip(5439666495 bytes)Available download formats
    Dataset updated
    Nov 2, 2025
    Authors
    Nguyễn Thanh
    Description

    Dataset

    This dataset was created by Nguyễn Thanh

    Contents

  12. Normalization-data

    • kaggle.com
    zip
    Updated Jul 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jay Oza (2022). Normalization-data [Dataset]. https://www.kaggle.com/datasets/jayoza198/normalizationdata
    Explore at:
    zip(4287 bytes)Available download formats
    Dataset updated
    Jul 25, 2022
    Authors
    Jay Oza
    Description

    Dataset

    This dataset was created by Jay Oza

    Contents

  13. dataset for applying normalization techniques

    • kaggle.com
    zip
    Updated Nov 4, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akalya Subramanian (2020). dataset for applying normalization techniques [Dataset]. https://www.kaggle.com/akalyasubramanian/dataset-for-applying-normalization-techniques
    Explore at:
    zip(3155326 bytes)Available download formats
    Dataset updated
    Nov 4, 2020
    Authors
    Akalya Subramanian
    Description

    Dataset

    This dataset was created by Akalya Subramanian

    Contents

  14. LibriTTS Clean 100 Raw Normalize

    • kaggle.com
    zip
    Updated Nov 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nguyễn Thanh (2025). LibriTTS Clean 100 Raw Normalize [Dataset]. https://www.kaggle.com/datasets/lookingformyself/libritts-clean-100-raw-normalize
    Explore at:
    zip(7525325886 bytes)Available download formats
    Dataset updated
    Nov 8, 2025
    Authors
    Nguyễn Thanh
    Description

    Dataset

    This dataset was created by Nguyễn Thanh

    Contents

  15. Entity-Normalization

    • kaggle.com
    zip
    Updated May 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mcasshy (2024). Entity-Normalization [Dataset]. https://www.kaggle.com/datasets/mcasshy/entity-normalization/code
    Explore at:
    zip(32246846 bytes)Available download formats
    Dataset updated
    May 24, 2024
    Authors
    mcasshy
    Description

    Dataset

    This dataset was created by mcasshy

    Contents

  16. normalization preprocessing

    • kaggle.com
    zip
    Updated May 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vinay sharma (2022). normalization preprocessing [Dataset]. https://www.kaggle.com/datasets/vinaysharma1212/normalization-preprocessing/code
    Explore at:
    zip(9011 bytes)Available download formats
    Dataset updated
    May 16, 2022
    Authors
    Vinay sharma
    Description

    Dataset

    This dataset was created by Vinay sharma

    Contents

  17. NORMALIZE STATIC ATTENTION 100k step traincontinue

    • kaggle.com
    zip
    Updated Jun 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VMHieu02 (2024). NORMALIZE STATIC ATTENTION 100k step traincontinue [Dataset]. https://www.kaggle.com/datasets/vmhieu02/normalize-static-attention-100k-step-traincontinue
    Explore at:
    zip(1071022430 bytes)Available download formats
    Dataset updated
    Jun 30, 2024
    Authors
    VMHieu02
    Description

    Dataset

    This dataset was created by VMHieu02

    Contents

  18. attens coco 1000 255 /255 and no normalize

    • kaggle.com
    zip
    Updated Mar 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SeaLeopard (2025). attens coco 1000 255 /255 and no normalize [Dataset]. https://www.kaggle.com/datasets/sealeopard/attens-coco-1000-255-255-and-no-normalize
    Explore at:
    zip(1881930937 bytes)Available download formats
    Dataset updated
    Mar 11, 2025
    Authors
    SeaLeopard
    Description

    Dataset

    This dataset was created by SeaLeopard

    Contents

  19. Bangla-Normalized-Data

    • kaggle.com
    zip
    Updated Jul 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Skylark4 (2024). Bangla-Normalized-Data [Dataset]. https://www.kaggle.com/skylark4/bangla-normalized-data
    Explore at:
    zip(3439588472 bytes)Available download formats
    Dataset updated
    Jul 4, 2024
    Authors
    Skylark4
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Skylark4

    Released under CC0: Public Domain

    Contents

  20. label_normalize

    • kaggle.com
    zip
    Updated Nov 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ngo Tri Si (2023). label_normalize [Dataset]. https://www.kaggle.com/datasets/ngotrisi/label-normalize
    Explore at:
    zip(1034582281 bytes)Available download formats
    Dataset updated
    Nov 17, 2023
    Authors
    Ngo Tri Si
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Ngo Tri Si

    Released under Apache 2.0

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hemanth S (2022). Normalized Dataset [Dataset]. https://www.kaggle.com/datasets/hemanth012/normalized-dataset
Organization logo

Normalized Dataset

Explore at:
zip(1009250933 bytes)Available download formats
Dataset updated
Jun 15, 2022
Authors
Hemanth S
Description

Dataset

This dataset was created by Hemanth S

Contents

Search
Clear search
Close search
Google apps
Main menu