Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Cleaned dataset from the Billionaires Statistic Dataset (2023) that can be found here.
The code I used to clean and re-structure the data is also here.
First things first: a big shout-out to Nidula Elgiriyewithana for providing the original data.
As with it, this dataset contains various information about the world's wealthiest persons in different columns that can be grouped into three different types:
If you want a challenge, you can create a dashboard using tools such as Plotly to dynamically visualize the data using one or different attributes (such as industry, age or country). I did it, leave the link below in case you want to investigate:
If you find this dataset informative or inspirational, a vote is appreciated for others to easily discover value in it ๐๐ฐ
Facebook
Twitterhttps://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Dataset Card for 100 Richest People In World
Dataset Summary
This dataset contains the list of Top 100 Richest People in the World Column Information:-
Name - Person Name NetWorth - His/Her Networth Age - Person Age Country - The country person belongs to Source - Information Source Industry - Expertise Domain
Join our Community
Supported Tasks and Leaderboards
[More Information Needed]
Languages
[More Information Needed]โฆ See the full description on the dataset page: https://huggingface.co/datasets/nateraw/100-richest-people-in-world.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for GDP reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset provides a synthetic overview of the 1,000 wealthiest individuals in the world, offering insights into the distribution of wealth across industries and regions. It is designed to help analysts, researchers, and data enthusiasts explore global wealth trends, industry dominance, and regional wealth concentration.
Whether you're conducting market research, financial analysis, or data modeling, this dataset serves as a valuable resource for understanding the characteristics of the world's top billionaires.
๐ Key Features: Name ๐ค: The name of the billionaire. Country ๐: Country of residence or primary business operation. Industry ๐ญ: Industry in which the individual has built their wealth. Net Worth (in billions) ๐ต: Estimated net worth in billions of USD. Company ๐ข: The primary company or business associated with the billionaire. โ ๏ธ Important Note: This dataset is 100% synthetic and does not contain real financial or personal data. It is artificially generated for educational, analytical, and research purposes.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States US: Account at a Financial Institution: Income: Richest 60%: % Aged 15+ data was reported at 97.904 % in 2014. This records an increase from the previous number of 92.810 % for 2011. United States US: Account at a Financial Institution: Income: Richest 60%: % Aged 15+ data is updated yearly, averaging 95.357 % from Dec 2011 (Median) to 2014, with 2 observations. The data reached an all-time high of 97.904 % in 2014 and a record low of 92.810 % in 2011. United States US: Account at a Financial Institution: Income: Richest 60%: % Aged 15+ data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Databaseโs United States โ Table US.World Bank.WDI: Banking Indicators. Account at a financial institution denotes the percentage of respondents who report having an account (by themselves or together with someone else) at a bank or another type of financial institution.; ; Demirguc-Kunt et al., 2015, Global Financial Inclusion Database, World Bank.; Weighted average;
Facebook
TwitterLinguistically annotated Spanish language datasets with headwords, definitions, senses, examples, POS tags, semantic metadata, and usage info. Ideal for dictionary tools, NLP, and TTS model training or fine-tuning.
Our Spanish language datasets are carefully compiled and annotated by language and linguistic experts; you can find them available for licensing:
Key Features (approximate numbers):
Our Spanish monolingual reliably offers clear definitions and examples, a large volume of headwords, and comprehensive coverage of the Spanish language.
The bilingual data provides translations in both directions, from English to Spanish and from Spanish to English. It is annually reviewed and updated by our in-house team of language experts. Offers significant coverage of the language, providing a large volume of translated words of excellent quality.
Spanish sentences retrieved from the corpus are ideal for NLP model training, presenting approximately 20 million words. The sentences provide a great coverage of Spanish-speaking countries and are accordingly tagged to a particular country or dialect.
This Spanish language dataset offers a rich collection of synonyms and antonyms, accompanied by detailed definitions and part-of-speech (POS) annotations, making it a comprehensive resource for building linguistically aware AI systems and language technologies.
Curated word-level audio data for the Spanish language, which covers all varieties of world Spanish, providing rich dialectal diversity in the Spanish language.
This language data contains a carefully curated and comprehensive list of 450,000 Spanish words.
Use Cases:
We consistently work with our clients on new use cases as language technology continues to evolve. These include NLP applications, TTS, dictionary display tools, games, translation, word embedding, and word sense disambiguation (WSD).
If you have a specific use case in mind that isn't listed here, weโd be happy to explore it with you. Donโt hesitate to get in touch with us at Oxford.Languages@oup.com to start the conversation.
Pricing:
Oxford Languages offers flexible pricing based on use case and delivery format. Our datasets are licensed via term-based IP agreements and tiered pricing for API-delivered data. Whether youโre integrating into a product, training an LLM, or building custom NLP solutions, we tailor licensing to your specific needs.
Contact our team or email us at Oxford.Languages@oup.com to explore pricing options and discover how our language data can support your goals.
About the sample:
The samples offer a brief overview of one or two language datasets (monolingual or/and bilingual dictionary data). To help you explore the structure and features of our dataset, we provide a sample in CSV format for preview purposes only.
If you need the complete original sample or more details about any dataset, please contact us (Growth.OL@oup.com) to request access or further information
Facebook
TwitterAs of March 2025, Elon Musk had a net worth valued at 328.5 billion U.S. dollars, making him the richest man in the world. Amazon founder Jeff Bezos followed in second, with Marc Zuckerberg, the founder of Facebook, in third. The list is dominated by Americans, and Alice Walton and Francoise Bettencourt Meyers are the only women among the 20 richest people worldwide.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for GOLD RESERVES reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
Facebook
Twitterhttps://www.newyork-demographics.com/terms_and_conditionshttps://www.newyork-demographics.com/terms_and_conditions
A dataset listing the 20 richest counties in New York for 2024, including information on rank, county, population, average income, and median income.
Facebook
Twitterhttps://www.michigan-demographics.com/terms_and_conditionshttps://www.michigan-demographics.com/terms_and_conditions
A dataset listing the 20 richest counties in Michigan for 2024, including information on rank, county, population, average income, and median income.
Facebook
Twitterhttps://www.florida-demographics.com/terms_and_conditionshttps://www.florida-demographics.com/terms_and_conditions
A dataset listing Florida counties by population for 2024.
Facebook
Twitterhttps://www.georgia-demographics.com/terms_and_conditionshttps://www.georgia-demographics.com/terms_and_conditions
A dataset listing Georgia counties by population for 2024.
Facebook
Twitterhttps://www.illinois-demographics.com/terms_and_conditionshttps://www.illinois-demographics.com/terms_and_conditions
A dataset listing the 20 richest counties in Illinois for 2024, including information on rank, county, population, average income, and median income.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the population of Rich Square by race. It includes the population of Rich Square across racial categories (excluding ethnicity) as identified by the Census Bureau. The dataset can be utilized to understand the population distribution of Rich Square across relevant racial categories.
Key observations
The percent distribution of Rich Square population by race (across all racial categories recognized by the U.S. Census Bureau): 39.49% are white, 53.21% are Black or African American, 1.54% are Asian, 1.67% are some other race and 4.10% are multiracial.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
Racial categories include:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Rich Square Population by Race & Ethnicity. You can refer the same here
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
1. ID: - Description: A unique identifier for each individual. - Example Values: 123, 456, 789, etc.
2. Name: - Description: The name of the individual. - Example Values: John Doe, Jane Smith, Michael Johnson, etc.
3.Sex: - Description: The gender of the individual. - Example Values: Male, Female.
4.Age: - Description: The age of the individual. - Example Values: 25, 30, 22, etc.
5.Height: - Description: The height of the individual. - Example Values: 175 cm, 160 cm, 185 cm, etc.
6.Weight: - Description: The weight of the individual. - Example Values: 70 kg, 55 kg, 80 kg, etc.
7.Team: - Description: The sports team the individual is associated with. - Example Values: Team USA, Team Canada, Team Germany, etc.
8.NOC: - Description: The National Olympic Committee code representing the country. - Example Values: USA, CAN, GER, etc.
9.Games: - Description: The specific Olympic Games event. - Example Values: Summer Olympics, Winter Olympics.
10.Year: - Description: The year of the Olympic Games. - Example Values: 2000, 2012, 2016, etc.
11.Season: - Description: The season of the Olympic Games. - Example Values: Summer, Winter.
12.City: - Description: The city where the Olympic Games were held. - Example Values: Sydney, Beijing, Tokyo, etc.
13.Sport: - Description: The sport in which the individual participated. - Example Values: Swimming, Athletics, Gymnastics, etc.
14.Event: - Description: The specific event within the sport. - Example Values: 100m Freestyle, Long Jump, Parallel Bars, etc.
15.Medal: - Description: The type of medal won by the individual in the event. - Example Values: Gold, Silver, Bronze, or blank if no medal was won.
1.NOC: - Description: The National Olympic Committee code representing the country. - Example Values: USA, CAN, GER, etc.
2.Region: - Description: The geographical region associated with the National Olympic Committee. - Example Values: North America, Europe, Asia, etc.
3.Notes: - Description: Additional notes or information related to the National Olympic Committee or region. - Example Values: Historical context, special considerations, etc.
This table seems to provide information about the National Olympic Committees, their associated regions, and any additional notes that might be relevant. The "NOC" column serves as a key to potentially link information between this table and the previous one you mentioned.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Cleaned dataset from the Billionaires Statistic Dataset (2023) that can be found here.
The code I used to clean and re-structure the data is also here.
First things first: a big shout-out to Nidula Elgiriyewithana for providing the original data.
As with it, this dataset contains various information about the world's wealthiest persons in different columns that can be grouped into three different types:
If you want a challenge, you can create a dashboard using tools such as Plotly to dynamically visualize the data using one or different attributes (such as industry, age or country). I did it, leave the link below in case you want to investigate:
If you find this dataset informative or inspirational, a vote is appreciated for others to easily discover value in it ๐๐ฐ