15 datasets found
  1. Billionaires dataset cleaned

    • kaggle.com
    zip
    Updated Feb 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Javier_SAB (2024). Billionaires dataset cleaned [Dataset]. https://www.kaggle.com/datasets/javiersab/billionaires-dataset-cleaned
    Explore at:
    zip(128906 bytes)Available download formats
    Dataset updated
    Feb 24, 2024
    Authors
    Javier_SAB
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Cleaned dataset from the Billionaires Statistic Dataset (2023) that can be found here.

    The code I used to clean and re-structure the data is also here.

    First things first: a big shout-out to Nidula Elgiriyewithana for providing the original data.

    As with it, this dataset contains various information about the world's wealthiest persons in different columns that can be grouped into three different types:

    • Business-related information. These columns contain data about the industry in which the billionaires' operate, their source of wealth, total wealth and position they occupy in the ranking.
    • Personal information. Such as name, age, nationality, country and city of residence.
    • Economic activity information. These columns are related to the country in which the billionaire resides and provide different economic indicators like GDP, education enrollment or Consumer Price Index (CPI).

    Column names

    • position. Ranking of the billionaire measured by their wealth.
    • wealth. The wealth of the billionaire measured in $.
    • industry. Industry in which the billionaire's operates their businesses.
    • full_name. Complete name of the billionaire.
    • age. The age of the billionaire.
    • country_of_residence. Country in which the billionaire resides.
    • city_of_residence. City in which the billionaire resides.
    • source. The source of the billionaire's wealth.
    • citizenship. The country of citizenship of the billionaire.
    • gender. The gender of the billionaire.
    • birth_date. The birth date of the billionaire.
    • last_name. The last name of the billionaire.
    • first_name. The first name of the billionaire.
    • residence_state. State in which the billionaire resides (only for billionaires who reside in the U.S.).
    • residence_region. Region in which the billionaire resides (only for billionaires who reside in the U.S.).
    • birth_year. The birth year of the billionaire.
    • birth_month. The birth month of the billionaire.
    • birth_day. The birth data of the billionaire.
    • cpi_country. Consumer Price Index (CPI) for the billionaire's country.
    • cpi_change_country. CPI change for the billionaire's country.
    • gdp_country. Gross Domestic Product (GDP) in $ for the billionaire's country.
    • g_tertiary_ed_enroll. Enrollment in tertiary education in the billionaire's country.
    • g_primary_ed_enroll. Enrollment in primary education in the billionaire's country.
    • life_expectancy. Life expectancy in the billionaire's country.
    • tax_revenue. Tax revenue in the billionaire's country.
    • tax_rate. Total tax rate in the billionaire's country.
    • country_pop. Population of the billionaire's country.
    • country_lat. Latitude coordinate of the billionaire's country.
    • country_long. Longitude coordinate of the billionaire's country.
    • continent. Continent in which the country of the billionaire's residence is located.

    Potential analyses

    • Analyze which industries contain the biggest groups of billionaires overall and in different countries.
    • Explore number of billionaires and total wealth across countries and continents and display the result in a map.
    • Focus on personal information columns such as age or gender to explore the distribution of billionaires from this perspective.
    • Discover if countries' economic indicators have any impact in the presence of billionaires.
    • The U.S. is the country with most billionaires presented in the dataset and also the only one with attributes in the residence_state and residence_region columns. This makes the American billionaires a good focus for a specific analysis.

    Bonus

    If you want a challenge, you can create a dashboard using tools such as Plotly to dynamically visualize the data using one or different attributes (such as industry, age or country). I did it, leave the link below in case you want to investigate:

    Dashboard notebook here


    If you find this dataset informative or inspirational, a vote is appreciated for others to easily discover value in it ๐Ÿ’Ž๐Ÿ’ฐ

  2. h

    100-richest-people-in-world

    • huggingface.co
    Updated Aug 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nate Raw (2023). 100-richest-people-in-world [Dataset]. https://huggingface.co/datasets/nateraw/100-richest-people-in-world
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 2, 2023
    Authors
    Nate Raw
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Area covered
    World
    Description

    Dataset Card for 100 Richest People In World

      Dataset Summary
    

    This dataset contains the list of Top 100 Richest People in the World Column Information:-

    Name - Person Name NetWorth - His/Her Networth Age - Person Age Country - The country person belongs to Source - Information Source Industry - Expertise Domain

      Join our Community
    
    
    
    
    
    
    
    
    
      Supported Tasks and Leaderboards
    

    [More Information Needed]

      Languages
    

    [More Information Needed]โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/nateraw/100-richest-people-in-world.

  3. T

    GDP by Country in AMERICA

    • tradingeconomics.com
    csv, excel, json, xml
    Updated May 30, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2017). GDP by Country in AMERICA [Dataset]. https://tradingeconomics.com/country-list/gdp?continent=america
    Explore at:
    csv, excel, xml, jsonAvailable download formats
    Dataset updated
    May 30, 2017
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    United States
    Description

    This dataset provides values for GDP reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.

  4. 1000 Richest People in the World

    • kaggle.com
    zip
    Updated Jul 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waqar Ali (2024). 1000 Richest People in the World [Dataset]. https://www.kaggle.com/datasets/waqi786/1000-richest-people-in-the-world
    Explore at:
    zip(8652 bytes)Available download formats
    Dataset updated
    Jul 28, 2024
    Authors
    Waqar Ali
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset provides a synthetic overview of the 1,000 wealthiest individuals in the world, offering insights into the distribution of wealth across industries and regions. It is designed to help analysts, researchers, and data enthusiasts explore global wealth trends, industry dominance, and regional wealth concentration.

    Whether you're conducting market research, financial analysis, or data modeling, this dataset serves as a valuable resource for understanding the characteristics of the world's top billionaires.

    ๐Ÿ“Š Key Features: Name ๐Ÿ‘ค: The name of the billionaire. Country ๐ŸŒ: Country of residence or primary business operation. Industry ๐Ÿญ: Industry in which the individual has built their wealth. Net Worth (in billions) ๐Ÿ’ต: Estimated net worth in billions of USD. Company ๐Ÿข: The primary company or business associated with the billionaire. โš ๏ธ Important Note: This dataset is 100% synthetic and does not contain real financial or personal data. It is artificially generated for educational, analytical, and research purposes.

  5. U

    United States US: Account at a Financial Institution: Income: Richest 60%: %...

    • ceicdata.com
    Updated Mar 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2023). United States US: Account at a Financial Institution: Income: Richest 60%: % Aged 15+ [Dataset]. https://www.ceicdata.com/en/united-states/banking-indicators/us-account-at-a-financial-institution-income-richest-60--aged-15
    Explore at:
    Dataset updated
    Mar 15, 2023
    Dataset provided by
    CEICdata.com
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Variables measured
    undefined
    Description

    United States US: Account at a Financial Institution: Income: Richest 60%: % Aged 15+ data was reported at 97.904 % in 2014. This records an increase from the previous number of 92.810 % for 2011. United States US: Account at a Financial Institution: Income: Richest 60%: % Aged 15+ data is updated yearly, averaging 95.357 % from Dec 2011 (Median) to 2014, with 2 observations. The data reached an all-time high of 97.904 % in 2014 and a record low of 92.810 % in 2011. United States US: Account at a Financial Institution: Income: Richest 60%: % Aged 15+ data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Databaseโ€™s United States โ€“ Table US.World Bank.WDI: Banking Indicators. Account at a financial institution denotes the percentage of respondents who report having an account (by themselves or together with someone else) at a bank or another type of financial institution.; ; Demirguc-Kunt et al., 2015, Global Financial Inclusion Database, World Bank.; Weighted average;

  6. Spanish Language Datasets | 1.8M+ Sentences | Translation Data | TTS |...

    • datarade.ai
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oxford Languages (2025). Spanish Language Datasets | 1.8M+ Sentences | Translation Data | TTS | Dictionary Display | Translations | EU & LATAM Coverage [Dataset]. https://datarade.ai/data-products/spanish-language-datasets-1-8m-sentences-nlp-tts-dic-oxford-languages
    Explore at:
    .json, .xml, .csv, .xls, .txt, .mp3, .wavAvailable download formats
    Dataset updated
    Jul 11, 2025
    Dataset authored and provided by
    Oxford Languageshttps://lexico.com/es
    Area covered
    Nicaragua, Honduras, Chile, Bolivia (Plurinational State of), Costa Rica, Ecuador, Cuba, Panama, Paraguay, Colombia
    Description

    Linguistically annotated Spanish language datasets with headwords, definitions, senses, examples, POS tags, semantic metadata, and usage info. Ideal for dictionary tools, NLP, and TTS model training or fine-tuning.

    Our Spanish language datasets are carefully compiled and annotated by language and linguistic experts; you can find them available for licensing:

    1. Spanish Monolingual Dictionary Data
    2. Spanish Bilingual Dictionary Data
    3. Spanish Sentences Data
    4. Synonyms and Antonyms Data
    5. Audio Data
    6. Spanish Word List Data

    Key Features (approximate numbers):

    1. Spanish Monolingual Dictionary Data

    Our Spanish monolingual reliably offers clear definitions and examples, a large volume of headwords, and comprehensive coverage of the Spanish language.

    • Words: 73,000
    • Senses: 123,000
    • Example sentences: 104,000
    • Format: XML and JSON formats
    • Delivery: Email (link-based file sharing) and REST API
    • Updated frequency: annually
    1. Spanish Bilingual Dictionary Data

    The bilingual data provides translations in both directions, from English to Spanish and from Spanish to English. It is annually reviewed and updated by our in-house team of language experts. Offers significant coverage of the language, providing a large volume of translated words of excellent quality.

    • Translations: 221,300
    • Senses: 103,500
    • Example sentences: 74,500
    • Example translations: 83,800
    • Format: XML and JSON formats
    • Delivery: Email (link-based file sharing) and REST API
    • Updated frequency: annually
    1. Spanish Sentences Data

    Spanish sentences retrieved from the corpus are ideal for NLP model training, presenting approximately 20 million words. The sentences provide a great coverage of Spanish-speaking countries and are accordingly tagged to a particular country or dialect.

    • Sentences volume: 1,840,000
    • Format: XML and JSON format
    • Delivery: Email (link-based file sharing) and REST API
    1. Spanish Synonyms and Antonyms Data

    This Spanish language dataset offers a rich collection of synonyms and antonyms, accompanied by detailed definitions and part-of-speech (POS) annotations, making it a comprehensive resource for building linguistically aware AI systems and language technologies.

    • Synonyms: 127,700
    • Antonyms: 9,500
    • Format: XML format
    • Delivery: Email (link-based file sharing)
    • Updated frequency: annually
    1. Spanish Audio Data (word-level)

    Curated word-level audio data for the Spanish language, which covers all varieties of world Spanish, providing rich dialectal diversity in the Spanish language.

    • Audio files: 20,900
    • Format: XLSX (for index), MP3 and WAV (audio files)
    1. Spanish Word List Data

    This language data contains a carefully curated and comprehensive list of 450,000 Spanish words.

    • Wordforms: 450,000
    • Format: CSV and TXT formats
    • Delivery: Email (link-based file sharing)

    Use Cases:

    We consistently work with our clients on new use cases as language technology continues to evolve. These include NLP applications, TTS, dictionary display tools, games, translation, word embedding, and word sense disambiguation (WSD).

    If you have a specific use case in mind that isn't listed here, weโ€™d be happy to explore it with you. Donโ€™t hesitate to get in touch with us at Oxford.Languages@oup.com to start the conversation.

    Pricing:

    Oxford Languages offers flexible pricing based on use case and delivery format. Our datasets are licensed via term-based IP agreements and tiered pricing for API-delivered data. Whether youโ€™re integrating into a product, training an LLM, or building custom NLP solutions, we tailor licensing to your specific needs.

    Contact our team or email us at Oxford.Languages@oup.com to explore pricing options and discover how our language data can support your goals.

    About the sample:

    The samples offer a brief overview of one or two language datasets (monolingual or/and bilingual dictionary data). To help you explore the structure and features of our dataset, we provide a sample in CSV format for preview purposes only.

    If you need the complete original sample or more details about any dataset, please contact us (Growth.OL@oup.com) to request access or further information

  7. Leading billionaires worldwide 2025

    • statista.com
    Updated Mar 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Leading billionaires worldwide 2025 [Dataset]. https://www.statista.com/statistics/272047/top-25-global-billionaires/
    Explore at:
    Dataset updated
    Mar 18, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Mar 2025
    Area covered
    World
    Description

    As of March 2025, Elon Musk had a net worth valued at 328.5 billion U.S. dollars, making him the richest man in the world. Amazon founder Jeff Bezos followed in second, with Marc Zuckerberg, the founder of Facebook, in third. The list is dominated by Americans, and Alice Walton and Francoise Bettencourt Meyers are the only women among the 20 richest people worldwide.

  8. T

    GOLD RESERVES by Country Dataset

    • tradingeconomics.com
    csv, excel, json, xml
    Updated May 26, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2017). GOLD RESERVES by Country Dataset [Dataset]. https://tradingeconomics.com/country-list/gold-reserves
    Explore at:
    excel, xml, csv, jsonAvailable download formats
    Dataset updated
    May 26, 2017
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    World
    Description

    This dataset provides values for GOLD RESERVES reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.

  9. n

    20 Richest Counties in New York

    • newyork-demographics.com
    Updated Jun 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristen Carney (2024). 20 Richest Counties in New York [Dataset]. https://www.newyork-demographics.com/richest_counties
    Explore at:
    Dataset updated
    Jun 20, 2024
    Dataset provided by
    Cubit Planning, Inc.
    Authors
    Kristen Carney
    License

    https://www.newyork-demographics.com/terms_and_conditionshttps://www.newyork-demographics.com/terms_and_conditions

    Area covered
    New York
    Description

    A dataset listing the 20 richest counties in New York for 2024, including information on rank, county, population, average income, and median income.

  10. m

    20 Richest Counties in Michigan

    • michigan-demographics.com
    Updated Jun 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristen Carney (2024). 20 Richest Counties in Michigan [Dataset]. https://www.michigan-demographics.com/richest_counties
    Explore at:
    Dataset updated
    Jun 20, 2024
    Dataset provided by
    Cubit Planning, Inc.
    Authors
    Kristen Carney
    License

    https://www.michigan-demographics.com/terms_and_conditionshttps://www.michigan-demographics.com/terms_and_conditions

    Area covered
    Michigan
    Description

    A dataset listing the 20 richest counties in Michigan for 2024, including information on rank, county, population, average income, and median income.

  11. f

    20 Richest Counties in Florida

    • florida-demographics.com
    Updated Jun 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristen Carney (2024). 20 Richest Counties in Florida [Dataset]. https://www.florida-demographics.com/counties_by_population
    Explore at:
    Dataset updated
    Jun 20, 2024
    Dataset provided by
    Cubit Planning, Inc.
    Authors
    Kristen Carney
    License

    https://www.florida-demographics.com/terms_and_conditionshttps://www.florida-demographics.com/terms_and_conditions

    Area covered
    Florida
    Description

    A dataset listing Florida counties by population for 2024.

  12. g

    20 Richest Counties in Georgia

    • georgia-demographics.com
    Updated Jun 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristen Carney (2024). 20 Richest Counties in Georgia [Dataset]. https://www.georgia-demographics.com/counties_by_population
    Explore at:
    Dataset updated
    Jun 20, 2024
    Dataset provided by
    Cubit Planning, Inc.
    Authors
    Kristen Carney
    License

    https://www.georgia-demographics.com/terms_and_conditionshttps://www.georgia-demographics.com/terms_and_conditions

    Area covered
    Georgia
    Description

    A dataset listing Georgia counties by population for 2024.

  13. i

    20 Richest Counties in Illinois

    • illinois-demographics.com
    Updated Jun 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristen Carney (2024). 20 Richest Counties in Illinois [Dataset]. https://www.illinois-demographics.com/richest_counties
    Explore at:
    Dataset updated
    Jun 20, 2024
    Dataset provided by
    Cubit Planning, Inc.
    Authors
    Kristen Carney
    License

    https://www.illinois-demographics.com/terms_and_conditionshttps://www.illinois-demographics.com/terms_and_conditions

    Area covered
    Illinois
    Description

    A dataset listing the 20 richest counties in Illinois for 2024, including information on rank, county, population, average income, and median income.

  14. N

    Rich Square, NC Population Breakdown By Race (Excluding Ethnicity) Dataset:...

    • neilsberg.com
    csv, json
    Updated Feb 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Rich Square, NC Population Breakdown By Race (Excluding Ethnicity) Dataset: Population Counts and Percentages for 7 Racial Categories as Identified by the US Census Bureau // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/rich-square-nc-population-by-race/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 21, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Rich Square, North Carolina
    Variables measured
    Asian Population, Black Population, White Population, Some other race Population, Two or more races Population, American Indian and Alaska Native Population, Asian Population as Percent of Total Population, Black Population as Percent of Total Population, White Population as Percent of Total Population, Native Hawaiian and Other Pacific Islander Population, and 4 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the racial categories idetified by the US Census Bureau. It is ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories, and do not rely on any ethnicity classification. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Rich Square by race. It includes the population of Rich Square across racial categories (excluding ethnicity) as identified by the Census Bureau. The dataset can be utilized to understand the population distribution of Rich Square across relevant racial categories.

    Key observations

    The percent distribution of Rich Square population by race (across all racial categories recognized by the U.S. Census Bureau): 39.49% are white, 53.21% are Black or African American, 1.54% are Asian, 1.67% are some other race and 4.10% are multiracial.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Racial categories include:

    • White
    • Black or African American
    • American Indian and Alaska Native
    • Asian
    • Native Hawaiian and Other Pacific Islander
    • Some other race
    • Two or more races (multiracial)

    Variables / Data Columns

    • Race: This column displays the racial categories (excluding ethnicity) for the Rich Square
    • Population: The population of the racial category (excluding ethnicity) in the Rich Square is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each race as a proportion of Rich Square total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Rich Square Population by Race & Ethnicity. You can refer the same here

  15. Olympics-Dataset

    • kaggle.com
    zip
    Updated Oct 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammad Kaif Tahir (2023). Olympics-Dataset [Dataset]. https://www.kaggle.com/datasets/mohammadkaiftahir/olympics-dataset
    Explore at:
    zip(5690772 bytes)Available download formats
    Dataset updated
    Oct 14, 2023
    Authors
    Mohammad Kaif Tahir
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Athlete Events.csv

    Columns name

    1. ID: - Description: A unique identifier for each individual. - Example Values: 123, 456, 789, etc.

    2. Name: - Description: The name of the individual. - Example Values: John Doe, Jane Smith, Michael Johnson, etc.

    3.Sex: - Description: The gender of the individual. - Example Values: Male, Female.

    4.Age: - Description: The age of the individual. - Example Values: 25, 30, 22, etc.

    5.Height: - Description: The height of the individual. - Example Values: 175 cm, 160 cm, 185 cm, etc.

    6.Weight: - Description: The weight of the individual. - Example Values: 70 kg, 55 kg, 80 kg, etc.

    7.Team: - Description: The sports team the individual is associated with. - Example Values: Team USA, Team Canada, Team Germany, etc.

    8.NOC: - Description: The National Olympic Committee code representing the country. - Example Values: USA, CAN, GER, etc.

    9.Games: - Description: The specific Olympic Games event. - Example Values: Summer Olympics, Winter Olympics.

    10.Year: - Description: The year of the Olympic Games. - Example Values: 2000, 2012, 2016, etc.

    11.Season: - Description: The season of the Olympic Games. - Example Values: Summer, Winter.

    12.City: - Description: The city where the Olympic Games were held. - Example Values: Sydney, Beijing, Tokyo, etc.

    13.Sport: - Description: The sport in which the individual participated. - Example Values: Swimming, Athletics, Gymnastics, etc.

    14.Event: - Description: The specific event within the sport. - Example Values: 100m Freestyle, Long Jump, Parallel Bars, etc.

    15.Medal: - Description: The type of medal won by the individual in the event. - Example Values: Gold, Silver, Bronze, or blank if no medal was won.

    noc_regions.csv

    Columns name

    1.NOC: - Description: The National Olympic Committee code representing the country. - Example Values: USA, CAN, GER, etc.

    2.Region: - Description: The geographical region associated with the National Olympic Committee. - Example Values: North America, Europe, Asia, etc.

    3.Notes: - Description: Additional notes or information related to the National Olympic Committee or region. - Example Values: Historical context, special considerations, etc.

    This table seems to provide information about the National Olympic Committees, their associated regions, and any additional notes that might be relevant. The "NOC" column serves as a key to potentially link information between this table and the previous one you mentioned.

  16. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Javier_SAB (2024). Billionaires dataset cleaned [Dataset]. https://www.kaggle.com/datasets/javiersab/billionaires-dataset-cleaned
Organization logo

Billionaires dataset cleaned

ready for Exploratory Data Analysis and Modeling

Explore at:
zip(128906 bytes)Available download formats
Dataset updated
Feb 24, 2024
Authors
Javier_SAB
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Cleaned dataset from the Billionaires Statistic Dataset (2023) that can be found here.

The code I used to clean and re-structure the data is also here.

First things first: a big shout-out to Nidula Elgiriyewithana for providing the original data.

As with it, this dataset contains various information about the world's wealthiest persons in different columns that can be grouped into three different types:

  • Business-related information. These columns contain data about the industry in which the billionaires' operate, their source of wealth, total wealth and position they occupy in the ranking.
  • Personal information. Such as name, age, nationality, country and city of residence.
  • Economic activity information. These columns are related to the country in which the billionaire resides and provide different economic indicators like GDP, education enrollment or Consumer Price Index (CPI).

Column names

  • position. Ranking of the billionaire measured by their wealth.
  • wealth. The wealth of the billionaire measured in $.
  • industry. Industry in which the billionaire's operates their businesses.
  • full_name. Complete name of the billionaire.
  • age. The age of the billionaire.
  • country_of_residence. Country in which the billionaire resides.
  • city_of_residence. City in which the billionaire resides.
  • source. The source of the billionaire's wealth.
  • citizenship. The country of citizenship of the billionaire.
  • gender. The gender of the billionaire.
  • birth_date. The birth date of the billionaire.
  • last_name. The last name of the billionaire.
  • first_name. The first name of the billionaire.
  • residence_state. State in which the billionaire resides (only for billionaires who reside in the U.S.).
  • residence_region. Region in which the billionaire resides (only for billionaires who reside in the U.S.).
  • birth_year. The birth year of the billionaire.
  • birth_month. The birth month of the billionaire.
  • birth_day. The birth data of the billionaire.
  • cpi_country. Consumer Price Index (CPI) for the billionaire's country.
  • cpi_change_country. CPI change for the billionaire's country.
  • gdp_country. Gross Domestic Product (GDP) in $ for the billionaire's country.
  • g_tertiary_ed_enroll. Enrollment in tertiary education in the billionaire's country.
  • g_primary_ed_enroll. Enrollment in primary education in the billionaire's country.
  • life_expectancy. Life expectancy in the billionaire's country.
  • tax_revenue. Tax revenue in the billionaire's country.
  • tax_rate. Total tax rate in the billionaire's country.
  • country_pop. Population of the billionaire's country.
  • country_lat. Latitude coordinate of the billionaire's country.
  • country_long. Longitude coordinate of the billionaire's country.
  • continent. Continent in which the country of the billionaire's residence is located.

Potential analyses

  • Analyze which industries contain the biggest groups of billionaires overall and in different countries.
  • Explore number of billionaires and total wealth across countries and continents and display the result in a map.
  • Focus on personal information columns such as age or gender to explore the distribution of billionaires from this perspective.
  • Discover if countries' economic indicators have any impact in the presence of billionaires.
  • The U.S. is the country with most billionaires presented in the dataset and also the only one with attributes in the residence_state and residence_region columns. This makes the American billionaires a good focus for a specific analysis.

Bonus

If you want a challenge, you can create a dashboard using tools such as Plotly to dynamically visualize the data using one or different attributes (such as industry, age or country). I did it, leave the link below in case you want to investigate:

Dashboard notebook here


If you find this dataset informative or inspirational, a vote is appreciated for others to easily discover value in it ๐Ÿ’Ž๐Ÿ’ฐ

Search
Clear search
Close search
Google apps
Main menu