2 datasets found
  1. CAR DATASET 2025

    • kaggle.com
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AYUSH KUMAR SINGH (2025). CAR DATASET 2025 [Dataset]. https://www.kaggle.com/datasets/mrjiiiayush/car-dataset-2025
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 7, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    AYUSH KUMAR SINGH
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Overview. This dataset holds rows of individual car models with both textual descriptors and numeric specifications. Typical columns include product identifiers (car name), brand/manufacturer, performance numbers (0–100 km/h), top speed, engine or battery capacity (CC or kWh), horsepower, fuel type, transmission, and the selling price. Some columns include units or symbols (e.g., “$”, “cc”, “sec”), so preprocessing is necessary to convert them into numeric features.

    Intended uses. Great for supervised regression (price prediction), classification after binning price ranges (budget vs premium), clustering/manufacturing segmentation, and teaching data cleaning (regex extraction), encoding categorical variables, imputation, and model evaluation. Also suitable for feature-importance and explainability demos (e.g., which specs most affect price).

    Typical problems to solve.

    Predict price from car specs (regression).

    Classify cars into price tiers (economy, mid-range, premium).

    Identify feature importances (what drives price — horsepower, brand, battery size?).

    Clean and parse messy textual numeric fields (hands-on regex practice).

  2. T

    Monthly evapotranspiration dataset with 30m spatial resolution over oasis in...

    • tpdc.ac.cn
    • casearthpoles.tpdc.ac.cn
    zip
    Updated Jan 16, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bingfang WU (2016). Monthly evapotranspiration dataset with 30m spatial resolution over oasis in the middle reaches and 1 km spatial resolution over the Heihe River Basin (2014) [Dataset]. http://doi.org/10.3972/heihe.0040.2016.db
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 16, 2016
    Dataset provided by
    TPDC
    Authors
    Bingfang WU
    Area covered
    黑河市,
    Description

    Using ETWatch model with the system complete the heihe river basin scale 1 km resolution 2014 surface evaporation data with middle oasis 30 meters resolution on scale data set, the surface evaporation raster image data of the data sets, it is the time resolution of scale from month to month, the spatial resolution of 1 km scale (covering the whole basin) and 30 meters scale (middle oasis area), the unit is mm.Data types include monthly, quarterly, and annual data. The projection information of the data is as follows: Albers equal-area cone projection, Central longitude: 110 degrees, First secant: 25 degrees, Second secant: 47 degrees, Coordinates by west: 4000000 meter.

    File naming rules are as follows: 1) 1 km resolution remote sensing data set Monthly cumulative ET value file name: heihe-1km_2014m01_eta.tif Heihe refers to heihe river basin, 1km means the resolution is 1km, 2014 means the year of 2014, m01 means the month of January, eta means the actual evapotranspiration data, and tif means the data is tif format. Name of quarterly cumulative ET value file: heihe-1km_2014s01_eta.tif Heihe represents the heihe river basin, 1km represents the resolution of 1km, 2014 represents the year of 2014, s01 represents the period from January to march, and the first quarter, eta represents the actual evapotranspiration data, and tif represents the data in tif format. Annual cumulative value file name: heihe-1km_2014y_eta.tif Heihe represents the heihe river basin, 1km represents the resolution of 1km, 2014 represents the year of 2014, y represents the year, eta represents the actual evapotranspiration data, and tif represents the data in tif format. 2) remote sensing data set with a resolution of 30 meters Monthly cumulative ET value file name: heihe-midoasa-30m_2014m01_eta.tif Heihe represents the heihe river basin, midoasis represents the mid-range oasis area, 30m represents the resolution of 30 meters, 2014 represents 2014, m01 represents January, eta represents the actual evapotranspiration data, and tif represents the data in tif format. Name of quarterly cumulative ET value file: heihe-midoasa-30m_2014s01_eta.tif Heihe represents the heihe river basin, midoasis represents the mid-range oasis area, 30m represents the resolution of 30 meters, 2014 represents 2014, s01 represents january-march, and the first quarter, eta represents the actual evapotranspiration data, and tif represents the data in tif format. Annual cumulative value file name: heihe-midoasa-30m_2014y_eta.tif Heihe represents the heihe river basin, midoasis represents the mid-range oasis area, 30m represents the resolution of 30 meters, 2014 represents the year of 2014, y represents the year, eta represents the actual evapotranspiration data, and tif represents the data in tif format.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
AYUSH KUMAR SINGH (2025). CAR DATASET 2025 [Dataset]. https://www.kaggle.com/datasets/mrjiiiayush/car-dataset-2025
Organization logo

CAR DATASET 2025

Cars Datasets 2025 — a tabular dataset of automobiles (names, manufacturer, tech

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 7, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
AYUSH KUMAR SINGH
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Overview. This dataset holds rows of individual car models with both textual descriptors and numeric specifications. Typical columns include product identifiers (car name), brand/manufacturer, performance numbers (0–100 km/h), top speed, engine or battery capacity (CC or kWh), horsepower, fuel type, transmission, and the selling price. Some columns include units or symbols (e.g., “$”, “cc”, “sec”), so preprocessing is necessary to convert them into numeric features.

Intended uses. Great for supervised regression (price prediction), classification after binning price ranges (budget vs premium), clustering/manufacturing segmentation, and teaching data cleaning (regex extraction), encoding categorical variables, imputation, and model evaluation. Also suitable for feature-importance and explainability demos (e.g., which specs most affect price).

Typical problems to solve.

Predict price from car specs (regression).

Classify cars into price tiers (economy, mid-range, premium).

Identify feature importances (what drives price — horsepower, brand, battery size?).

Clean and parse messy textual numeric fields (hands-on regex practice).

Search
Clear search
Close search
Google apps
Main menu