18 datasets found

wagon-images
kaggle.com
Updated May 17, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
neelaydoshi7 (2023). wagon-images [Dataset]. https://www.kaggle.com/datasets/neelaydoshi7/wagon-images
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 17, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
neelaydoshi7
Description
Dataset

This dataset was created by neelaydoshi7

Contents
test_wagon_images
kaggle.com
Updated May 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
sodagar rajiv (2023). test_wagon_images [Dataset]. https://www.kaggle.com/datasets/sodagarrajiv/test-wagon-images
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 18, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
sodagar rajiv
Description
Dataset

This dataset was created by sodagar rajiv

Contents
Wagon number dataset
kaggle.com
Updated May 17, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Khabibullokhon Bakhshilloev (2021). Wagon number dataset [Dataset]. https://www.kaggle.com/khabibullokhon/wagon-number-dataset/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 17, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Khabibullokhon Bakhshilloev
Description
Dataset

This dataset was created by Khabibullokhon Bakhshilloev

Released under Data files © Original Authors

Contents
Data_Wagon_Sosamba
kaggle.com
Updated Nov 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Master Sniffer (2023). Data_Wagon_Sosamba [Dataset]. https://www.kaggle.com/datasets/mastersniffer/data-wagon-sosamba/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 11, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Master Sniffer
Description
Dataset

This dataset was created by Master Sniffer

Contents
A
‘Shopping Cart Database’ analyzed by Analyst-2
analyst-2.ai
Updated Jan 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘Shopping Cart Database’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-shopping-cart-database-4459/abe88bba/?iid=010-225&v=presentation
Explore at:
Dataset updated
Jan 28, 2022
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Shopping Cart Database’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/ruchi798/shopping-cart-database on 28 January 2022.

--- Dataset description provided by original source is as follows ---

Context

This dataset contains synthetic data generated by me for one of my courses at Carnegie Mellon University.

Inspiration

Several deductions and analyses can be drawn from this data, including: - Which products were sold the most in the last month? - How have sales and revenue changed over the past few quarters? - Understanding Customer demographics and their preferences

--- Original source retains full ownership of the source dataset ---
GigaFlexhicle Dataset
kaggle.com
Updated Jun 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
QwertyNice (2022). GigaFlexhicle Dataset [Dataset]. https://www.kaggle.com/datasets/dmitrygaus/gigaflexhicle
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 28, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
QwertyNice
Description
The GigaFlexhicle dataset contains many cut-out photos of brands and models of vehicles (cars/trucks/buses), divided by generations. All models of vehicles produced in the same generation belong to the same class, regardless of the body, restyling or special edition. For example, Opel Astra J hatchback, station wagon, GTC are one class, regardless of the change in appearance during restyling. Opel Astra J and Opel Astra H are different classes.

Dataset structure: The images directory contains all vehicle brands. Inside folders with brands are folders with models of this brand. Inside each folder with the model there are possible generations of this model (usually they are listed as 1st, 2nd, etc., however, there may also be names of generations. Further deepening into the directory is considered as one generation). The annotation.csv file contains the class numbers and the relative path to each image. The class was assigned only if it contains 3 or more pictures. The number of classes is 9548. Directories with 2 or less pictures were left for further expansion and search for similar pictures.

For example, I need to take a look at all the photos of the BMW 2-series of the 2nd generation. This folder will contain such 2nd generation brands as 2-series, 2-series Active Tourer and 2-series Gran Tourer.

This dataset is not the final option and its structure will be improved, and it will be supplemented by itself. The dataset may include motorcycles, but their presence is not assumed.

The data used in this dataset has been copied from the website PlatesMania.com
Random Shopping Carts
kaggle.com
zip
Updated Apr 17, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DataWizard20 (2019). Random Shopping Carts [Dataset]. https://www.kaggle.com/datawizard20/random-shopping-carts
Explore at:
zip(46878 bytes)Available download formats
Dataset updated
Apr 17, 2019
Authors
DataWizard20
Description
Dataset

This dataset was created by DataWizard20

Contents

It contains the following files:
OTTO Recommender Systems Dataset
kaggle.com
Updated Feb 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Otto (GmbH & Co KG) (2023). OTTO Recommender Systems Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/4991874
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/4991874
Dataset updated
Feb 14, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Otto (GmbH & Co KG)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The OTTO session dataset is a large-scale dataset intended for multi-objective recommendation research. We collected the data from anonymized behavior logs of the OTTO webshop and the app. The mission of this dataset is to serve as a benchmark for session-based recommendations and foster research in the multi-objective and session-based recommender systems area. We also launched a Kaggle competition with the goal to predict clicks, cart additions, and orders based on previous events in a user session.

For additional background, please see the published OTTO Recommender Systems Dataset GitHub.

Key Features

12M real-world anonymized user sessions

220M events, consiting of clicks, carts and orders

1.8M unique articles in the catalogue

Ready to use data in .jsonl format

Evaluation metrics for multi-objective optimization

Dataset Statistics

Dataset #sessions #items #events #clicks #carts #orders Density [%]
Train 12.899.779 1.855.603 216.716.096 194.720.954 16.896.191 5.098.951 0.0005
Test 1.671.803 1.019.357 13.851.293 12.340.303 1.155.698 355.292 0.0005

Train/Test Split

Since we want to evaluate a model's performance in the future, as would be the case when we deploy such a system in an actual webshop, we choose a time-based validation split. Our train set consists of observations from 4 weeks, while the test set contains user sessions from the following week. Furthermore, we trimmed train sessions overlapping with the test period, as depicted in the following diagram, to prevent information leakage from the future:

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F4621388%2F94cead9aec2ef687490b1212e40f409a%2Ftrain_test_split.png?generation=1676645044801713&alt=media" alt="Train/Test Split">
Shopping cart
kaggle.com
zip
Updated May 21, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Faisal Jamil (2020). Shopping cart [Dataset]. https://www.kaggle.com/faisaljamil/shopping-cart
Explore at:
zip(209860 bytes)Available download formats
Dataset updated
May 21, 2020
Authors
Faisal Jamil
Description
Dataset

This dataset was created by Faisal Jamil

Released under Other (specified in description)

Contents

It contains the following files:
Donner Party
kaggle.com
Updated Jul 16, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kevon Nelson (2019). Donner Party [Dataset]. https://www.kaggle.com/morales2018/donner-party/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 16, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Kevon Nelson
Description
Context

The Donner Party were a group of emigrants moving to start a new life in California. But between 1846 and 1847, 45 out of the 87 people on the wagon train would die from sickness, starvation, murder, and cannibalism.

Content

The dataset has information gathered from reading Daniel James Brown, The Indifferent Stars Above, and overall reading online

Acknowledgements

James Brown, Daniel. The Indifferent Stars Above. New York: HarperCollins Publishers, 2015.

Inspiration

A question that I had while researching the Donner Party was what factors influenced their chances of survival(Age, Sex, etc). So this was made in an attempt to answer that question
yesssss
kaggle.com
Updated Jan 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cart Vista (2025). yesssss [Dataset]. https://www.kaggle.com/datasets/cartvista/yesssss/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 6, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Cart Vista
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by Cart Vista

Released under CC0: Public Domain

Contents
Stated Preferences for Car Choice
kaggle.com
data.mendeley.com
Updated May 3, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Steven Taylor (2019). Stated Preferences for Car Choice [Dataset]. https://www.kaggle.com/steventaylor11/stated-preferences-for-car-choice/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 3, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Steven Taylor
Description
Context

This Data set including all the Variables (choice, college,hsg2, coml5, typez, fuelz, pricez, speedz, pollutionz, sizez) from 2016 to 2018.I scrapped this data from www.qed.econ.queensu.ca

Content

Attribute Information:

choice-choice of a vehicle among 6 propositions

college-college education

hsg2-size of household greater than 2

coml5-commute lower than 5 miles a day

typez-body type, one of regcar (regular car), sportuv (sport utility vehicle), sportcar, stwagon (station wagon), truck, van, for each proposition z from 1 to 6

fuelz-fuel for proposition z, one of gasoline, methanol, cng (compressed natural gas), electric.

pricez-price of vehicle divided by the logarithm of income

speedz-highest attainable speed in hundreds of mph

pollutionz-tailpipe emissions as fraction of those for new gas vehicle

sizez- 0 for a mini, 1 for a subcompact, 2 for a compact and 3 for a mid–size or large vehicle

Source

McFadden, Daniel and Kenneth Train (2000) “Mixed MNL models for discrete response”, Journal of Applied Econometrics, 15(5), 447–470.

References

Journal of Applied Econometrics data archive : http://qed.econ.queensu.ca/jae/.
Cart Classification Technique Competition
kaggle.com
Updated Aug 2, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Satya Mishra (2018). Cart Classification Technique Competition [Dataset]. https://www.kaggle.com/thebratattack/cart-classification-technique-competition/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 2, 2018
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Satya Mishra
Description
Dataset

This dataset was created by Satya Mishra

Contents
2024 Car Power Weight Ratios and Values
kaggle.com
zip
Updated Apr 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cliff Chiang (2024). 2024 Car Power Weight Ratios and Values [Dataset]. https://www.kaggle.com/datasets/cliffchiang/2024-car-power-weight-ratios-and-their-values
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Apr 19, 2024
Authors
Cliff Chiang
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset containing vehicles sold in the US market of 2024-2025 year. Compares horsepower, torque, weight, and ratios of all makes and models sold in the US market of 2024-2025.

Data is taken from manufacturer website and Car & Driver where applicable.

I only compared data with vehicles designed, marketed, and sold as sedans or lower. Wagons were included where applicable. The Mercedes E-class wagon was excluded due to lack of data found. Data excludes vehicles sold and marketed as CUV and above (CUVs, SUVs, Trucks, Vans, etc.)
Traffic Violations Dataset
kaggle.com
Updated Nov 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NIKHIL (2023). Traffic Violations Dataset [Dataset]. https://www.kaggle.com/datasets/nikhil1e9/traffic-violations
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 29, 2023
Dataset provided by
Kaggle
Authors
NIKHIL
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Description

This dataset contains traffic violation information from electronic traffic violations issued in the US. Any information that can be used to uniquely identify the vehicle, the vehicle owner or the officer issuing the violation is not included. Some features were removed from the original dataset and all remaining character features were recoded as nominal factor variables. All punctuation characters were removed from factor levels. The variable 'Violation.Type' is used as target by default. The smaller target categories 'SERO' and 'ESERO' were collapsed into one category labeled 'SERO'. Unused factor levels and a few almost constant features were dropped.

Features

Description: Text description of the specific charge

Belts: If seat belts were in use in accident cases or not?

Personal Injury: If traffic violation involved Personal Injury or not?

Property Damage: If traffic violation involved Property Damage or not?

Commercial License: If the driver holds a Commercial Drivers License or not?

Commercial Vehicle: If the vehicle committing the traffic violation is a commercial vehicle or not?

State: State issuing the vehicle registration

VehicleType: Type of vehicle (Examples: Automobile, Station Wagon, Heavy Duty Truck, etc.)

Year: Year the vehicle was made

Make: Manufacturer of the vehicle (Examples: Ford, Chevy, Honda, Toyota, etc.)

Model: Model of the vehicle

Color: Color of the vehicle

Charge: Alphanumeric code for the specific charge

Contributed To Accident: If the traffic violation was a contributing factor in an accident or not?

Race: Race of the driver (Example: Asian, Black, White, Other, etc.)

Gender: Gender of the driver (F = Female, M = Male)

Driver City: City of the driver’s home address

Driver State: State of the driver’s home address

DL State: State issuing the Driver’s License

Arrest Type: Type of Arrest (A = Marked, B = Unmarked, etc.)

Violation Type: Type of Violation (Examples: Warning, Citation, SERO)

Please, provide an upvote👍if the dataset was useful for your task. It would be much appreciated😄
Data from: Gender Recognition by Voice
kaggle.com
Updated Aug 26, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kory Becker (2016). Gender Recognition by Voice [Dataset]. https://www.kaggle.com/datasets/primaryobjects/voicegender/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 26, 2016
Dataset provided by
Kaggle
Authors
Kory Becker
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Voice Gender

Gender Recognition by Voice and Speech Analysis

This database was created to identify a voice as male or female, based upon acoustic properties of the voice and speech. The dataset consists of 3,168 recorded voice samples, collected from male and female speakers. The voice samples are pre-processed by acoustic analysis in R using the seewave and tuneR packages, with an analyzed frequency range of 0hz-280hz (human vocal range).

The Dataset

The following acoustic properties of each voice are measured and included within the CSV:

meanfreq: mean frequency (in kHz)

sd: standard deviation of frequency

median: median frequency (in kHz)

Q25: first quantile (in kHz)

Q75: third quantile (in kHz)

IQR: interquantile range (in kHz)

skew: skewness (see note in specprop description)

kurt: kurtosis (see note in specprop description)

sp.ent: spectral entropy

sfm: spectral flatness

mode: mode frequency

centroid: frequency centroid (see specprop)

peakf: peak frequency (frequency with highest energy)

meanfun: average of fundamental frequency measured across acoustic signal

minfun: minimum fundamental frequency measured across acoustic signal

maxfun: maximum fundamental frequency measured across acoustic signal

meandom: average of dominant frequency measured across acoustic signal

mindom: minimum of dominant frequency measured across acoustic signal

maxdom: maximum of dominant frequency measured across acoustic signal

dfrange: range of dominant frequency measured across acoustic signal

modindx: modulation index. Calculated as the accumulated absolute difference between adjacent measurements of fundamental frequencies divided by the frequency range

label: male or female

Accuracy

Baseline (always predict male)

50% / 50%

Logistic Regression

97% / 98%

CART

96% / 97%

Random Forest

100% / 98%

SVM

100% / 99%

XGBoost

100% / 99%

Research Questions

An original analysis of the data-set can be found in the following article:

Identifying the Gender of a Voice using Machine Learning

The best model achieves 99% accuracy on the test set. According to a CART model, it appears that looking at the mean fundamental frequency might be enough to accurately classify a voice. However, some male voices use a higher frequency, even though their resonance differs from female voices, and may be incorrectly classified as female. To the human ear, there is apparently more than simple frequency, that determines a voice's gender.

Questions

What other features differ between male and female voices?

Can we find a difference in resonance between male and female voices?

Can we identify falsetto from regular voices? (separate data-set likely needed for this)

Are there other interesting features in the data?

CART Diagram

http://i.imgur.com/Npr2U7O.png" alt="CART model">

Mean fundamental frequency appears to be an indicator of voice gender, with a threshold of 140hz separating male from female classifications.

References

The Harvard-Haskins Database of Regularly-Timed Speech

Telecommunications & Signal Processing Laboratory (TSP) Speech Database at McGill University, Home

VoxForge Speech Corpus, Home

Festvox CMU_ARCTIC Speech Database at Carnegie Mellon University
Auto Scout Car Price
kaggle.com
Updated Dec 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yaşar Yiğit Turan (2024). Auto Scout Car Price [Dataset]. https://www.kaggle.com/datasets/yaaryiitturan/auto-scout-car-price/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 2, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Yaşar Yiğit Turan
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
This dataset provides comprehensive details on used car listings, including vehicle specifications, features, pricing, and more. It's valuable for analyzing car prices, trends, and customer preferences in the automotive market.

Columns Description

make_model: The brand and model of the vehicle (e.g., 'Audi A1').

body_type: The body style of the vehicle, such as Sedan, Compact, or Station Wagon.

price: The listed price of the car in currency.

vat: Indicates the VAT status for the vehicle's price (e.g., VAT deductible, Price negotiable).

km: The total mileage (in kilometers) of the vehicle, indicating its usage.

Type: Condition of the vehicle, whether it's 'Used' or 'New'.

Fuel: Type of fuel the vehicle uses, such as 'Diesel', 'Benzine', etc.

Gears: The number of gears in the vehicle's transmission.

Comfort_Convenience: Comfort and convenience features, such as 'Air conditioning', 'Leather steering wheel', 'Cruise control', and more.

Entertainment_Media: Media features available in the vehicle, including 'Bluetooth', 'MP3', 'Radio', etc.

Extras: Additional features like 'Alloy wheels', 'Sport suspension', etc.

Safety_Security: Safety features like 'ABS', 'Airbags', 'Electronic stability control', 'Isofix', etc.

age: Age of the car (calculated based on the model year).

Previous_Owners: The number of previous owners the car has had.

hp_kW: Engine power in kilowatts (kW), indicating the performance capacity of the engine.

Inspection_new: Indicates whether the car has recently undergone an inspection (1 for yes, 0 for no).

Paint_Type: The type of paint on the car, such as 'Metallic', 'Matte', etc.

Upholstery_type: The material used for the interior upholstery, such as 'Cloth', 'Leather', etc.

Gearing_Type: The type of transmission the car uses, either 'Automatic' or 'Manual'.

Displacement_cc: The engine displacement in cubic centimeters (cc), indicating the size of the engine.

Weight_kg: The total weight of the vehicle in kilograms.

Drive_chain: The type of drivetrain, indicating whether it's 'Front' or 'Rear' wheel drive.

cons_comb: The combined fuel consumption in liters per 100 kilometers.

Key Features

Vehicle Specifications: Covers details like make, model, body type, fuel type, and more.

Comfort & Safety Features: Includes information on air conditioning, safety features, and other convenience options.

Performance Metrics: Provides data on mileage, engine power, weight, and fuel consumption.

Pricing Information: Insights into vehicle pricing, VAT status, and other cost-related details.

Ideal Use Cases

Price Prediction: Model car prices based on features like mileage, fuel type, and performance.

Market Analysis: Explore trends and preferences in the used car market, by type, region, or other metrics.

Customer Segmentation: Segment the dataset to analyze different customer preferences for car types, features, or price ranges.

Feature Importance: Identify the most important factors influencing car prices (e.g., fuel type, mileage, age).

This dataset is ideal for machine learning, data analysis, and business intelligence applications in the automotive industry.
1985 Automobile Dataset
kaggle.com
zip
Updated Feb 27, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fazil T (2019). 1985 Automobile Dataset [Dataset]. https://www.kaggle.com/fazilbtopal/auto85
Explore at:
zip(4784 bytes)Available download formats
Dataset updated
Feb 27, 2019
Authors
Fazil T
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Context

Dataset is about cars from back in 85. It's raw and messy.

Content

This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics, (b) its assigned insurance risk rating, (c) its normalized losses in use as compared to other cars. The second rating corresponds to the degree to which the auto is more risky than its price indicates. Cars are initially assigned a risk factor symbol associated with its price. Then, if it is more risky (or less), this symbol is adjusted by moving it up (or down) the scale. Actuarians call this process "symboling". A value of +3 indicates that the auto is risky, -3 that it is probably pretty safe. The third factor is the relative average loss payment per insured vehicle year. This value is normalized for all autos within a particular size classification (two-door small, station wagons, sports/specialty, etc...), and represents the average loss per car per year.

Note: Several of the attributes in the database could be used as a "class" attribute.

Number of Instances: 205

Number of Attributes: 26 total
-- 15 continuous
-- 1 integer
-- 10 nominal

Attribute Information:

| Attribute | Attribute Range

symboling: -3, -2, -1, 0, 1, 2, 3

normalized-losses: continuous from 65 to 256

make: alfa-romero, audi, bmw, chevrolet, dodge, honda, isuzu, jaguar, mazda, mercedes-benz, mercury, mitsubishi, nissan, peugot, plymouth, porsche, renault, saab, subaru, toyota, volkswagen, volvo

fuel-type: diesel, gas

aspiration: std, turbo

num-of-doors: four, two

body-style: hardtop, wagon, sedan, hatchback, convertible

drive-wheels: 4wd, fwd, rwd

engine-location: front, rear

wheel-base: continuous from 86.6 120.9

length: continuous from 141.1 to 208.1

width: continuous from 60.3 to 72.3

height: continuous from 47.8 to 59.8

curb-weight: continuous from 1488 to 4066

engine-type: dohc, dohcv, l, ohc, ohcf, ohcv, rotor

num-of-cylinders: eight, five, four, six, three, twelve, two

engine-size: continuous from 61 to 326

fuel-system: 1bbl, 2bbl, 4bbl, idi, mfi, mpfi, spdi, spfi

bore: continuous from 2.54 to 3.94

stroke: continuous from 2.07 to 4.17

compression-ratio: continuous from 7 to 23

horsepower: continuous from 48 to 288

peak-rpm: continuous from 4150 to 6600

city-mpg: continuous from 13 to 49

highway-mpg: continuous from 16 to 54

price: continuous from 5118 to 45400.

Acknowledgements

"Automobile Data Set" from the following link: https://archive.ics.uci.edu/ml/machine-learning-databases/autos/imports-85.data.

Inspiration

I used these data for data cleaning/analysis purposes.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Dataset	#sessions	#items	#events	#clicks	#carts	#orders	Density [%]
Train	12.899.779	1.855.603	216.716.096	194.720.954	16.896.191	5.098.951	0.0005
Test	1.671.803	1.019.357	13.851.293	12.340.303	1.155.698	355.292	0.0005

Facebook

Twitter

Click to copy link

Link copied

Cite

neelaydoshi7 (2023). wagon-images [Dataset]. https://www.kaggle.com/datasets/neelaydoshi7/wagon-images

wagon-images

Explore at:

68 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

May 17, 2023

Dataset provided by

Kagglehttp://kaggle.com/

Authors

neelaydoshi7

Description

Dataset

This dataset was created by neelaydoshi7

Clear search

Close search

Google apps

Main menu

wagon-images

Dataset

Contents

test_wagon_images

Dataset

Contents

Wagon number dataset

Dataset

Contents

Data_Wagon_Sosamba

Dataset

Contents

‘Shopping Cart Database’ analyzed by Analyst-2

Context

Inspiration

GigaFlexhicle Dataset

Random Shopping Carts

Dataset

Contents

OTTO Recommender Systems Dataset

Key Features

Dataset Statistics

Train/Test Split

Shopping cart

Dataset

Contents

Donner Party

Context

Content

Acknowledgements

Inspiration

yesssss

Dataset

Contents

Stated Preferences for Car Choice

Context

Content

Source

References

Cart Classification Technique Competition

Dataset

Contents

2024 Car Power Weight Ratios and Values

Traffic Violations Dataset

Description

Features

Data from: Gender Recognition by Voice

Voice Gender

The Dataset

Accuracy

Baseline (always predict male)

Logistic Regression

CART

Random Forest

SVM

XGBoost

Research Questions

Questions

CART Diagram

References

Auto Scout Car Price

Columns Description

Key Features

Ideal Use Cases

1985 Automobile Dataset

Context

Content

Attribute Information:

Acknowledgements

Inspiration

wagon-images

Dataset

Contents