Alesco Data's Automotive records are updated monthly from millions of proprietary sourced vehicle transactions. These incoming transactions are processed through compilation rules and are either added as new, incremental records to our file, or contribute to validating existing records.
Our recent focus is on compiling new vehicle ownership, and the file includes over 14.2 million late model vehicle owners (2020-2025).
In addition, we append our Persistent ID, telephone numbers, and demographics for a complete file that can support your direct mail and email marketing, lead validation, and identity verification needs. A Persistent ID is assigned to each vehicle record and tracks consumers as they change addresses or phone numbers, and vehicles as they change owners.
The database is not derived from state motor vehicle databases and therefore not subject to the Shelby Act also known as the Driver's Privacy Protection Act (DPPA) of 2000. The data is deterministic and sources include sales and service data, warranty data and notifications, aftermarket repair and maintenance facilities, and scheduled maintenance records.
Fields Included: -Make -Model -Year -VIN -Vehicle Class Code (crossover, SUV, full-size, mid-size, small) -Vehicle Fuel Code (gas, flex, hybrid) -Vehicle Style Code (sport, pickup, utility, sedan) -Mileage -Number of Vehicles per Household -First seen date -Last seen date -Email
In 2023, California had the most automobile registrations: almost 13.2 million such vehicles were registered in the most populous U.S. federal state. California also had the highest number of registered motor vehicles overall: nearly 30.4 million registrations.
Datasys Automotive Owners dataset includes 80M+ verified U.S. vehicle owners with make, model, and year details, updated quarterly for accuracy.
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the US English Language In-car Speech Dataset, a comprehensive collection of audio recordings designed to facilitate the development of speech recognition models specifically tailored for in-car environments. This dataset aims to support research and innovation in automotive speech technology, enabling seamless and robust voice interactions within vehicles for drivers and co-passengers.
This dataset comprises over 5,000 high-quality audio recordings collected from various in-car environments. These recordings include scripted wake words and command-type prompts.
Participant Diversity:
- Speakers: 50+ native English speakers from the FutureBeeAI Community.
- Regions: Ensures a balanced representation of United States of America1 accents, dialects, and demographics.
- Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.
Recording Nature: Scripted wake word and command type of audio recordings.
- Duration: Average duration of 5 to 20 seconds per audio recording.
- Formats: WAV format with mono channels, a bit depth of 16 bits. The dataset contains different data at 16kHz and 48kHz.
Apart from participant diversity, the dataset is diverse in terms of different wake words, voice commands, and recording environments.
Different Automobile Related Wake Words: Hey Mercedes, Hey BMW, Hey Porsche, Hey Volvo, Hey Audi, Hi Genesis, Hey Mini, Hey Toyota, Ok Ford, Hey Hyundai, Ok Honda, Hello Kia, Hey Dodge.
Different Cars: Data collection was carried out in different types and models of cars.
Different Types of Voice Commands:
- Navigational Voice Commands
- Mobile Control Voice Commands
- Car Control Voice Commands
- Multimedia & Entertainment Commands
- General, Question Answer, Search Commands
Recording Time: Participants recorded the given prompts at various times to make the dataset more diverse.
- Morning
- Afternoon
- Evening
Recording Environment: Various recording environments were captured to acquire more realistic data and to make the dataset inclusive of various types of noises. Some of the environment variables are as follows:
- Noise Level: Silent, Low Noise, Moderate Noise, High Noise
- Parking Location: Indoor, Outdoor
- Car Windows: Open, Closed
- Car AC: On, Off
- Car Engine: On, Off
- Car Movement: Stationary, Moving
The dataset provides comprehensive metadata for each audio recording and participant:
Participant Metadata: Unique identifier, age, gender, country, state, district, accent, and dialect.
Other Metadata: Recording transcript, recording environment, device details, sample rate, bit depth, file format, recording time.
This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of English voice assistant speech recognition models.
This US English In-car audio dataset is created by FutureBeeAI and is available for commercial use.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Car Registrations in the United States increased to 241.10 Thousand in August from 221.50 Thousand in July of 2025. This dataset provides - United States Car Registrations - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This table contains values from Compare.com's proprietary database of car insurance quotes about average DynamicTable.dataset.coverage.monthly_cost_total car insurance costs DynamicTable.dataset.source.stateAvgPrices
This dataset shows the Battery Electric Vehicles (BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs) that are currently registered through Washington State Department of Licensing (DOL).
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Forecast: Ownership of Passenger Cars in the US 2022 - 2026 Discover more data with ReportLinker!
https://www.icpsr.umich.edu/web/ICPSR/studies/35604/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/35604/terms
The United States Census Bureau has conducted surveys of manufacturing activity since 1810 with fluctuating frequency. Between 1919 and 1939 the Census of Manufactures (CM) was conducted biennially. This data collection consists of individual-plant data from the Census of Manufactures for 1929, 1931, 1933, and 1935, the only years in this span for which original returns are available. The records of the Motor Vehicle Industry have been coded to produce an electronic data set to provide the basis for microeconomic evidence for the study of the Great Depression. The data set contains observations on: basic information about the plants (e.g. name, location, owner, etc.), products made and materials used, operation and working hours, employment, wages and salaries, costs and amount of materials used, value and quantity of products by type, and power used.
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the US Spanish Language In-car Speech Dataset, a comprehensive collection of audio recordings designed to facilitate the development of speech recognition models specifically tailored for in-car environments. This dataset aims to support research and innovation in automotive speech technology, enabling seamless and robust voice interactions within vehicles for drivers and co-passengers.
This dataset comprises over 5,000 high-quality audio recordings collected from various in-car environments. These recordings include scripted wake words and command-type prompts.
Participant Diversity:
- Speakers: 50+ native Spanish speakers from the FutureBeeAI Community.
- Regions: Ensures a balanced representation of USA1 accents, dialects, and demographics.
- Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.
Recording Nature: Scripted wake word and command type of audio recordings.
- Duration: Average duration of 5 to 20 seconds per audio recording.
- Formats: WAV format with mono channels, a bit depth of 16 bits. The dataset contains different data at 16kHz and 48kHz.
Apart from participant diversity, the dataset is diverse in terms of different wake words, voice commands, and recording environments.
Different Automobile Related Wake Words: Hey Mercedes, Hey BMW, Hey Porsche, Hey Volvo, Hey Audi, Hi Genesis, Hey Mini, Hey Toyota, Ok Ford, Hey Hyundai, Ok Honda, Hello Kia, Hey Dodge.
Different Cars: Data collection was carried out in different types and models of cars.
Different Types of Voice Commands:
- Navigational Voice Commands
- Mobile Control Voice Commands
- Car Control Voice Commands
- Multimedia & Entertainment Commands
- General, Question Answer, Search Commands
Recording Time: Participants recorded the given prompts at various times to make the dataset more diverse.
- Morning
- Afternoon
- Evening
Recording Environment: Various recording environments were captured to acquire more realistic data and to make the dataset inclusive of various types of noises. Some of the environment variables are as follows:
- Noise Level: Silent, Low Noise, Moderate Noise, High Noise
- Parking Location: Indoor, Outdoor
- Car Windows: Open, Closed
- Car AC: On, Off
- Car Engine: On, Off
- Car Movement: Stationary, Moving
The dataset provides comprehensive metadata for each audio recording and participant:
Participant Metadata: Unique identifier, age, gender, country, state, district, accent, and dialect.
Other Metadata: Recording transcript, recording environment, device details, sample rate, bit depth, file format, recording time.
This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of Spanish voice assistant speech recognition models.
This US Spanish In-car audio dataset is created by FutureBeeAI and is available for commercial use.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Key information about United States Motor Vehicle Sales: Passenger Cars
This dataset contains the file of vehicle, snowmobile and boat registrations in NYS. Registrations expired more than 2 years are excluded. Records that have a scofflaw, revocation and/or suspension are included with indicators specifying those kinds of records.
This layer shows household size by number of vehicles available. This is shown by tract, county, and state centroids. This service is updated annually to contain the most currently released American Community Survey (ACS) 5-year data, and contains estimates and margins of error. There are also additional calculated attributes related to this topic, which can be mapped or used within analysis. This layer is symbolized to show the count and percentage of households with no vehicle available. To see the full list of attributes available in this service, go to the "Data" tab, and choose "Fields" at the top right. Current Vintage: 2019-2023ACS Table(s): B08201 Data downloaded from: Census Bureau's API for American Community Survey Date of API call: December 12, 2024National Figures: data.census.govThe United States Census Bureau's American Community Survey (ACS):About the SurveyGeography & ACSTechnical DocumentationNews & UpdatesThis ready-to-use layer can be used within ArcGIS Pro, ArcGIS Online, its configurable apps, dashboards, Story Maps, custom apps, and mobile apps. Data can also be exported for offline workflows. For more information about ACS layers, visit the FAQ. Please cite the Census and ACS when using this data.Data Note from the Census:Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see Accuracy of the Data). The effect of nonsampling error is not represented in these tables.Data Processing Notes:This layer is updated automatically when the most current vintage of ACS data is released each year, usually in December. The layer always contains the latest available ACS 5-year estimates. It is updated annually within days of the Census Bureau's release schedule. Click here to learn more about ACS data releases.Boundaries come from the US Census TIGER geodatabases, specifically, the National Sub-State Geography Database (named tlgdb_(year)_a_us_substategeo.gdb). Boundaries are updated at the same time as the data updates (annually), and the boundary vintage appropriately matches the data vintage as specified by the Census. These are Census boundaries with water and/or coastlines erased for cartographic and mapping purposes. For census tracts, the water cutouts are derived from a subset of the 2020 Areal Hydrography boundaries offered by TIGER. Water bodies and rivers which are 50 million square meters or larger (mid to large sized water bodies) are erased from the tract level boundaries, as well as additional important features. For state and county boundaries, the water and coastlines are derived from the coastlines of the 2023 500k TIGER Cartographic Boundary Shapefiles. These are erased to more accurately portray the coastlines and Great Lakes. The original AWATER and ALAND fields are still available as attributes within the data table (units are square meters).The States layer contains 52 records - all US states, Washington D.C., and Puerto RicoCensus tracts with no population that occur in areas of water, such as oceans, are removed from this data service (Census Tracts beginning with 99).Percentages and derived counts, and associated margins of error, are calculated values (that can be identified by the "_calc_" stub in the field name), and abide by the specifications defined by the American Community Survey.Field alias names were created based on the Table Shells file available from the American Community Survey Summary File Documentation page.Negative values (e.g., -4444...) have been set to null, with the exception of -5555... which has been set to zero. These negative values exist in the raw API data to indicate the following situations:The margin of error column indicates that either no sample observations or too few sample observations were available to compute a standard error and thus the margin of error. A statistical test is not appropriate.Either no sample observations or too few sample observations were available to compute an estimate, or a ratio of medians cannot be calculated because one or both of the median estimates falls in the lowest interval or upper interval of an open-ended distribution.The median falls in the lowest interval of an open-ended distribution, or in the upper interval of an open-ended distribution. A statistical test is not appropriate.The estimate is controlled. A statistical test for sampling variability is not appropriate.The data for this geographic area cannot be displayed because the number of sample cases is too small.
Sales of used light vehicles in the United States came to around **** million units in 2024. In the same period, approximately **** million new light trucks and automobiles were sold here. Declining availability of vehicles In the fourth quarter of 2024, about ***** million vehicles were in operation in the United States, an increase of around *** percent year-over-year. The rising demand for vehicles paired with an overall price inflation lead to a rise in new vehicle prices. In contrast, used vehicle prices slightly decreased. E-commerce: a solution for the bumpy road ahead? Financial reports have revealed how the outbreak of the coronavirus pandemic has triggered a shift in vehicle-buying behavior. With many consumer goods and services now bought online due to COVID-19, the automobile industry has also started to digitally integrate its services online to reach consumers with a preference for contactless test driving amid the global crisis. Several dealers and automobile companies had already begun to tap into online car sales before the pandemic, some of them being Carvana and Tesla.
Gambit's Automotive Vehicle Ownership data set consists of high resolution ownership data including VIN, name, phone, email, make, model, year, color, and mileage for over 10 million unique records within the U.S. Records cover 5 years of historical data and are updated quarterly.
We recognize that every business has unique needs for data applications and frequently find pre-built data sets too bulky, too expensive, or out of scope. If your application requires additional attributes or features not already included, Gambit also offers affordable alternative solutions through custom data sets that are optimized to meet your application's needs.
This data is aggregated via publicly available sources using proprietary methodologies.
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the Canadian French Language In-car Speech Dataset, a comprehensive collection of audio recordings designed to facilitate the development of speech recognition models specifically tailored for in-car environments. This dataset aims to support research and innovation in automotive speech technology, enabling seamless and robust voice interactions within vehicles for drivers and co-passengers.
This dataset comprises over 5,000 high-quality audio recordings collected from various in-car environments. These recordings include scripted wake words and command-type prompts.
Participant Diversity:
- Speakers: 50+ native French speakers from the FutureBeeAI Community.
- Regions: Ensures a balanced representation of Canada1 accents, dialects, and demographics.
- Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.
Recording Nature: Scripted wake word and command type of audio recordings.
- Duration: Average duration of 5 to 20 seconds per audio recording.
- Formats: WAV format with mono channels, a bit depth of 16 bits. The dataset contains different data at 16kHz and 48kHz.
Apart from participant diversity, the dataset is diverse in terms of different wake words, voice commands, and recording environments.
Different Automobile Related Wake Words: Hey Mercedes, Hey BMW, Hey Porsche, Hey Volvo, Hey Audi, Hi Genesis, Hey Mini, Hey Toyota, Ok Ford, Hey Hyundai, Ok Honda, Hello Kia, Hey Dodge.
Different Cars: Data collection was carried out in different types and models of cars.
Different Types of Voice Commands:
- Navigational Voice Commands
- Mobile Control Voice Commands
- Car Control Voice Commands
- Multimedia & Entertainment Commands
- General, Question Answer, Search Commands
Recording Time: Participants recorded the given prompts at various times to make the dataset more diverse.
- Morning
- Afternoon
- Evening
Recording Environment: Various recording environments were captured to acquire more realistic data and to make the dataset inclusive of various types of noises. Some of the environment variables are as follows:
- Noise Level: Silent, Low Noise, Moderate Noise, High Noise
- Parking Location: Indoor, Outdoor
- Car Windows: Open, Closed
- Car AC: On, Off
- Car Engine: On, Off
- Car Movement: Stationary, Moving
The dataset provides comprehensive metadata for each audio recording and participant:
Participant Metadata: Unique identifier, age, gender, country, state, district, accent, and dialect.
Other Metadata: Recording transcript, recording environment, device details, sample rate, bit depth, file format, recording time.
This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of French voice assistant speech recognition models.
This Canadian French In-car audio dataset is created by FutureBeeAI and is available for commercial use.
In 2024, the auto industry in the United States sold approximately 15.9 million light vehicle units. This figure includes retail sales of about three million passenger cars and just under 12.9 million light trucks. Lower fuel consumption There are many kinds of light vehicles available in the United States. Light-duty vehicles are popular for their utility and improved fuel economy, making them an ideal choice for savvy consumers. As of Model Year 2023, the light vehicle manufacturer with the best overall miles per gallon was Kia, with one gallon of gas allowing for 30.4 miles on the road. Higher brand satisfaction When asked about light vehicle satisfaction, consumers in the United States were most satisfied with Toyota, Subaru, Tesla, and Mercedes-Benz models. Another survey conducted in 2018 and quizzing respondents on their stance regarding the leading car brands indicated that Lexus was among the most dependable brands based on the number of problems reported per 100 vehicles.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Operating-Income Time Series for Hagerty Inc. Hagerty, Inc. provides insurance services for collector cars and enthusiast vehicles in the United States. The company engages in the underwriting, selling, and servicing collector car and enthusiast vehicle insurance policies. It also provides Hagerty Drivers Club memberships, which are primarily bundled with insurance policies and give subscribers access to an array of products and services, such as emergency roadside assistance, hagerty drivers club magazine, automotive enthusiast events, proprietary vehicle valuation tool, and special vehicle related discounts. In addition, the company offers marketplace to complement insurance and membership offerings. Hagerty, Inc. is headquartered in Traverse City, Michigan. Hagerty, Inc. is a subsidiary of Hagerty Holding Corp.
Datasys Automotive Demographics provides a detailed view of 60M+ U.S. vehicle owners enriched with age, income, household, lifestyle, and geographic attributes. This dataset enables marketers to understand the people behind the vehicles, build more relevant campaigns, and target based on ownership plus demographic context. It is valuable for insurers, auto dealers, and service providers aiming to reach specific consumer groups with tailored offers.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Defensive-Intrval-Ratio Time Series for Hagerty Inc. Hagerty, Inc. provides insurance services for collector cars and enthusiast vehicles in the United States. The company engages in the underwriting, selling, and servicing collector car and enthusiast vehicle insurance policies. It also provides Hagerty Drivers Club memberships, which are primarily bundled with insurance policies and give subscribers access to an array of products and services, such as emergency roadside assistance, hagerty drivers club magazine, automotive enthusiast events, proprietary vehicle valuation tool, and special vehicle related discounts. In addition, the company offers marketplace to complement insurance and membership offerings. Hagerty, Inc. is headquartered in Traverse City, Michigan.
Alesco Data's Automotive records are updated monthly from millions of proprietary sourced vehicle transactions. These incoming transactions are processed through compilation rules and are either added as new, incremental records to our file, or contribute to validating existing records.
Our recent focus is on compiling new vehicle ownership, and the file includes over 14.2 million late model vehicle owners (2020-2025).
In addition, we append our Persistent ID, telephone numbers, and demographics for a complete file that can support your direct mail and email marketing, lead validation, and identity verification needs. A Persistent ID is assigned to each vehicle record and tracks consumers as they change addresses or phone numbers, and vehicles as they change owners.
The database is not derived from state motor vehicle databases and therefore not subject to the Shelby Act also known as the Driver's Privacy Protection Act (DPPA) of 2000. The data is deterministic and sources include sales and service data, warranty data and notifications, aftermarket repair and maintenance facilities, and scheduled maintenance records.
Fields Included: -Make -Model -Year -VIN -Vehicle Class Code (crossover, SUV, full-size, mid-size, small) -Vehicle Fuel Code (gas, flex, hybrid) -Vehicle Style Code (sport, pickup, utility, sedan) -Mileage -Number of Vehicles per Household -First seen date -Last seen date -Email