Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual distribution of students across grade levels in August Martin High School
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This comprehensive synthetic dataset contains 2,514 authentic mobile app reviews spanning 40+ popular applications across 24 different languages, making it ideal for multilingual NLP, sentiment analysis, and cross-cultural user behavior research.
Column Name | Data Type | Description | Sample Values | Null Count |
---|---|---|---|---|
review_id | Integer | Unique identifier for each review | 1, 2, 3, ... | 0 |
user_id | String* | User identifier (should be integer) | "1967825", "9242600" | 0 |
app_name | String | Name of the mobile application | WhatsApp, Instagram, TikTok | 0 |
app_category | String | Application category | Social Networking, Entertainment | 0 |
review_text | String | Multilingual review content | "This app is amazing!" | 63 |
review_language | String | ISO language code | en, es, fr, zh, hi, ar | 0 |
rating | Mixed* | App rating (1.0-5.0, some as strings) | 4.5, "3.2", 1.1 | 38 |
review_date | DateTime | Timestamp of review submission | 2024-10-09 19:26:40 | 0 |
verified_purchase | Boolean | Purchase verification status | True, False | 0 |
device_type | String | Device platform | Android, iOS, iPad, Windows Phone | 0 |
num_helpful_votes | Mixed* | Helpfulness votes (some as strings) | 65, "209", 163 | 0 |
user_age | Float* | User age (should be integer) | 14.0, 18.0, 67.0 | 0 |
user_country | String | User's country | China, Germany, Nigeria | 50 |
user_gender | String | User gender | Male, Female, Non-binary, Prefer not to say | 88 |
app_version | String | Application version number | 1.4, v8.9, 2.8.37.5926 | 25 |
Note: Data types marked with asterisk require cleaning/conversion
The dataset includes reviews in 24 languages: - European: English (en), Spanish (es), French (fr), German (de), Italian (it), Russian (ru), Polish (pl), Dutch (nl), Swedish (sv), Danish (da), Norwegian (no), Finnish (fi) - Asian: Chinese (zh), Hindi (hi), Japanese (ja), Korean (ko), Thai (th), Vietnamese (vi), Indonesian (id), Malay (ms) - Other: Arabic (ar), Turkish (tr), Filipino (tl)
Reviews cover 18 distinct categories:
- Social Networking
- Entertainment
- Productivity
- Travel & Local
- Music & Audio
- Video Players & Editors
- Shopping
- Navigation
- Finance
- Communication
- Education
- Photography
- Dating
- Business
- Utilities
- Health & Fitness
- Games
- News & Magazines
40+ applications including: - Social: WhatsApp, Instagram, Facebook, Snapchat, TikTok, LinkedIn, Twitter, Reddit, Pinterest - Entertainment: YouTube, Netflix, Spotify - Productivity: Microsoft Office, Google Drive, Dropbox, OneDrive, Zoom, Discord - Travel: Uber, Lyft, Airbnb, Booking.com, Google Maps, Waze - Finance: PayPal, Venmo - Education: Duolingo, Khan Academy, Coursera, Udemy - Tools: Grammarly, Canva, Adobe Photoshop, VLC, MX Player
Reviews from 24 countries across all continents: - Asia: China, India, Japan, South Korea, Thailand, Vietnam, Indonesia, Malaysia, Philippines, Pakistan, Bangladesh - Europe: Germany, United Kingdom, France, Italy, Spain, Russia, Turkey, Poland - Americas: United States, Canada, Brazil, Mexico - Oceania: Australia - Africa: Nigeria
Intentional data challenges for learning:
- Missing Values: Strategic nulls in review_text (63), rating (38), user_country (50), user_gender (88), app_version (25)
- Data Type Issues:
- user_id stored as strings (should be integers)
- user_age as floats (should be integers)
- Some ratings as strings (should be floats)
- Some helpful_votes as strings (should be integers)
- Mixed Version Formats: "1.4", "v8.9", "2.8.37.5926", "14.1.60.318-beta"
This dataset is perfect for: - Multilingual NLP projects and sentiment analysis - Cross-cultural user behavior analysis - App store analytics and rating prediction - Data cleaning and preprocessing practice - Text classification across multiple languages - Time series analysis of app reviews - Geographic sentiment analysis - Data engineering pipeline development
Richard M. Nixon was elected President of the United States on November 5th 1968, beating the Democratic Party candidate Hubert Humphrey with 43.4 percent of the vote. Nixon's presidency was notable for his focus on withdrawing U.S. troops from Vietnam, restoring diplomatic relations with China, the first oil crisis of 1973, and, ultimately, the Watergate scandal. Nixon's approval ratings during his first term were strong, leading to his re-election in November 1972, when he received 60.7 percent of the national vote, easily defeating the democratic nominee George McGovern.
Watergate and Nixon's resignation
Public opinion rapidly shifted against Nixon after the election, as news broke of the illegal activities his staff orchestrated during the 1968 election campaign, where they wiretapped phones in the Democratic National Committee's headquarters at the Watergate Complex. The Nixon administration spent the following year trying to cover up evidence of their involvement in the break-in, but the President committed impeachable offenses in the process. With rapidly-declining popularity among the public, as well as the opening of an impeachment process in October 1973, Richard Nixon resigned in August 1974 - this makes him the only president in U.S. history to resign from office.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The International Chess Federation (FIDE) governs international chess competition. FIDE used Elo rating system for calculating the relative skill levels of players.
The dataset contains details of all the chess players in the world sorted by their Standard FIDE rating (highest to lowest) as updated by FIDE in August 2020. The data includes all active and inactive players which can be identified by the Inactive_flag column.
Note: All ratings are updated as published by FIDE in August 2020.
FIDE: https://www.fide.com/
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual distribution of students across grade levels in August Ahrens Elementary School
In August 2022, the most popular evening television news program in Thailand was Thai Rath News Show on Thai Rath 32 channel with a rating score of approximately ***. This was followed by Toob Thoh Khao by Amarin TV which received a rating score of around ***.
A ranking of the Office of Hearings Operations (OHO) hearing offices by the average number of days until final disposition of the hearing request. The average shown will be a combined average for all cases completed in that hearing office. Report for August 2024.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The International Chess Federation (FIDE) governs international chess competition. FIDE used Elo rating system for calculating the relative skill levels of players.
The dataset contains details of Top women chess players in the world sorted by their Standard FIDE rating (highest to lowest above 1800 Elo) as updated in August 2020. The data includes all active and inactive players which can be identified by the Inactive_flag column.
Note: All ratings are updated as published by FIDE in August 2020.
FIDE: https://www.fide.com/
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual overall school rank from 2012 to 2023 for August Martin High School
A ranking of Office of Hearings Operations (OHO) hearing offices by the average number of hearings dispositions per administrative law judge (ALJ) per day. The average shown will be a combined average for all ALJs working in that hearing office. Report for August 2015.
According to a survey conducted in July 2025, around 49 percent of Americans had a very unfavorable view of Donald Trump, while 24 percent of Americans held a very favorable view. Donald Trump was elected President of the United States in November 2024. The former president will be sworn in for a second term on January 20, 2025. Shifting perceptions of trustworthiness Despite the significant portion of Americans who view Trump unfavorably, his perceived trustworthiness has shown improvement over time. A September 2024 survey found that 41 percent of registered voters considered Trump honest and trustworthy, marking an increase from 38 percent in 2016. Policy proposals and partisan support Trump's policy proposals have continued to garner strong support from his Republican base while facing opposition from Democrats. An August 2024 survey showed roughly 85 percent of Republicans backing Trump's plan to arrest and deport thousands of illegal immigrants, compared to only 22 percent of Democrats. This stark partisan divide on key policy issues reflects the broader polarization in Trump's favorability ratings.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This table presents an overview of the number of guests and their overnight stays in the Netherlands in all hotels, motels, boarding houses, apartments with hotel services, youth accommodation and bed & breakfasts with at least 5 sleeping places. The figures can be broken down by country of residence of the guest, and by star rating. Figures are available for The Netherlands as a whole, and for the city of Amsterdam.
The breakdown by star rating is based on the opinion of the accommodation itself. The star rating does not have to be officially registered. The breakdown contains all types of accommodation mentioned above, not just hotels. The '5 stars' category contains 5 star hotels, but also for instance 5 star bed&breakfasts.
Break in series: Figures on guests and overnight stays per star rating for the years until 2015, that were published before, were based on offical registrations of the number of stars by the 'Bedrijfschap Horeca en Catering'. This official registration does not exist any longer. Therefore, Statistics Netherlands started asking accommodations about their number of stars in its annual survey. For this reason, the figures in this table are not directly comparable with figures published about the years until 2015.
Data available from: 2017
Status of the figures: The figures for 2023 are revised provisional, figures for 2024 are provisional and all other figures are final.
Changes as of 11 July 2024: The provisional figures for May 2024 have been added.
Changes as of 11 December 2023: The provisional figures for September and October 2023 have been added. Despite the care with which the figures and previous publications have been compiled about all overnight accommodation establishments in the Netherlands, it has been noticed that the published figures for the reporting periods April 2022 to August 2023 are incorrect. Statistics Netherlands has published improved figures for the statistics on all overnight accommodation establishments in the Netherlands for the reporting periods April 2022 to August 2023. Also the improved figures of the associated quarters and the figure for 2022 are published.
Changes as of 14 November 2023: Despite the care with which the figures and previous publications have been compiled about all overnight accommodation establishments in the Netherlands, it has been noticed that the published figures for the reporting periods after March 2022 are incorrect due to a technical error. Statistics Netherlands is investigating the influence of this technical error on the results of this statistic and other statistics that use this statistic.
Based on this research, Statistics Netherlands will publish improved figures for the statistics on all overnight accommodation establishments in the Netherlands on December 11. These concerns the months April 2022 to October 2023 and the associated quarters. This means that the publication already planned for November 14 (covering the reporting month of September) will be postponed for a few weeks.
When will new figures be published? Figures of a new month become available within three months after the end of that month, these are provisional figures. The figures for the complete year are revised one month after publication of the December figures, these are revised provisional figures. Two months later definite figures will be published.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual distribution of students across grade levels in August Boeger Middle School
In October 2020, the overall Import Price Index stood at 123.2. This a slight decrease from the previous month. The Index consists of all fuel and nonfuel imports and has a base period of 2000=100. The data are not seasonally adjusted.
In August 2025, approximately 59 percent of people in Great Britain had a positive opinion of King Charles III, compared with 31 percent who had a negative opinion. Just before his coronation on May 6, 2023, 59 percent of people in Great Britain had a positive opinion, with 33 percent who had a negative opinion. Between October 2019 and April 2023, King Charles was viewed most positively in September 2022, when 70 percent of Britons had a positive opinion of him, and viewed most negatively in March 2021, when 42 percent of respondents had a negative view of him.
As of June 2025, approximately ** percent of respondents in Japan approved of the cabinet, while ** percent disapproved. The new cabinet led by Shigeru Ishiba was inaugurated in October 2024. His predecessor, Fumio Kishida, announced his resignation in August 2024 due to declining approval ratings. During the measured period, the approval rate was highest when the previous cabinet of Yoshihide Suga was inaugurated in September 2020.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This table presents an overview of the capacity (type of accommodation, rooms, beds) in the Netherlands in all hotels, motels, boarding houses, apartments with hotel services, youth accommodation and bed & breakfasts with at least 5 sleeping places. The figures can be broken down by star rating. Figures are available for The Netherlands as a whole, and for the city of Amsterdam.
The breakdown by star rating is based on the opinion of the accommodation itself. The star rating does not have to be officially registered. The breakdown contains all types of accommodation mentioned above, not just hotels. The '5 stars' category contains 5 star hotels, but also for instance 5 star bed&breakfasts.
Break in series: Figures on guests and overnight stays per star rating for the years until 2015, that were published before, were based on offical registrations of the number of stars by the 'Bedrijfschap Horeca en Catering'. This official registration does not exist any longer. Therefore, Statistics Netherlands started asking accommodations about their number of stars in its annual survey. For this reason, the figures in this table are not directly comparable with figures published about the years until 2015.
Data available from: 2017
Status of the figures: The figures for 2023 are revised provisional, figures for 2024 are provisional and all other figures are final.
Changes as of 11 July 2024: The provisional figures for May 2024 have been added.
Changes as of 11 December 2023: The provisional figures for September and October 2023 have been added. Despite the care with which the figures and previous publications have been compiled about all overnight accommodation establishments in the Netherlands, it has been noticed that the published figures for the reporting periods April 2022 to August 2023 are incorrect. Statistics Netherlands has published improved figures for the statistics on all overnight accommodation establishments in the Netherlands for the reporting periods April 2022 to August 2023. Also the improved figures of the associated quarters and the figure for 2022 are published.
Changes as of 14 November 2023: Despite the care with which the figures and previous publications have been compiled about all overnight accommodation establishments in the Netherlands, it has been noticed that the published figures for the reporting periods after March 2022 are incorrect due to a technical error. Statistics Netherlands is investigating the influence of this technical error on the results of this statistic and other statistics that use this statistic.
Based on this research, Statistics Netherlands will publish improved figures for the statistics on all overnight accommodation establishments in the Netherlands on December 11. These concerns the months April 2022 to October 2023 and the associated quarters. This means that the publication already planned for November 14 (covering the reporting month of September) will be postponed for a few weeks.
When will new figures be published? Figures of a new month become available within three months after the end of that month, these are provisional figures. The figures for the complete year are revised one month after publication of the December figures, these are revised provisional figures. Two months later definite figures will be published.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual distribution of students across grade levels in August Schilling Elementary School
This statistic shows the best-rated hotels in the United States as of October 2016. According to Condé Nast Traveler readers, the best hotel in the United States was the Virgin Hotels Chicago, Chicago, Illinois for which they gave a score of 98.81.
The data has been scraped from Playstore website using Selenium script. The data is in raw format with Full review content from each user. The Data can be used to perform Classification and Clustering into multiple label set (UX, Bug, Other..) after manually labelling the data. The reviews generally talk about the sentiment of the user, but it holds different set of information about the application which is very essential for the App developers. The goal from this data is to identify the reviews nature and sort out the reviews that can be used in productive nature.
This data was a trial run for the script and to identify if the application (IKEA) has the amount of reviews that can be categorized to perform Classification and build a generalized model.
Time Period: April 2018 - Aug 2019
This work was done along @pragya1798
Application on Playstore - IKEA Playstore Application
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual distribution of students across grade levels in August Martin High School