Wikipedia
Source: https://huggingface.co/datasets/wikipedia Num examples: 1,281,412 Language: Vietnamese
from datasets import load_dataset
load_dataset("tdtunlp/wikipedia_vi")
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Vietnam Japan is a dataset for object detection tasks - it contains J And P annotations for 627 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Binhvq News
Source: https://github.com/binhvq/news-corpus Num examples: 19,365,593 Language: Vietnamese
from datasets import load_dataset
load_dataset("tdtunlp/binhvq_news_vi")
https://www.icpsr.umich.edu/web/ICPSR/studies/31101/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/31101/terms
The 1991 Vietnam Life History Survey is a cross-sectional study conducted to examine households and individuals in Vietnam. A 2-part survey was conducted, the first part focused on the respondents' household as the unit of analysis, information was collected for up to 15 respondents, although most households had only 4 to 6 respondents. The second part of the survey focused on individuals, the respondent's position in the household and their personal background. In the Individual dataset, observations were collected for up to 15 of the respondent's siblings. The 2 parts examined 4 samples of about 100 households, each stratified by region and urban/rural status in Vietnam with the household survey containing 403 household responses and the individual survey containing 921 respondents. Demographic variables in the Household dataset include region, household configuration, socioeconomic status, gender, ethnicity, appliance ownership, and house construction. Demographic variables in the Individual dataset include information on parents and siblings, familial occupations, ethnicity, sex, education, job history, marital status, and children information.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The total population in Vietnam was estimated at 101.3 million people in 2024, according to the latest census figures and projections from Trading Economics. This dataset provides - Vietnam Population - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Emotion recognition is a higher approach or special case of sentiment analysis. In this task, the result is not produced in terms of either polarity: positive or negative or in the form of rating (from 1 to 5) but of a more detailed level of sentiment analysis in which the result are depicted in more expressions like sadness, enjoyment, anger, disgust, fear and surprise. Emotion recognition plays a critical role in measuring brand value of a product by recognizing specific emotions of customers’ comments. In this study, we have achieved two targets. First and foremost, we built a standard Vietnamese Social Media Emotion Corpus (UIT-VSMEC) with about 6,927 human-annotated sentences with six emotion labels, contributing to emotion recognition research in Vietnamese which is a low-resource language in Natural Language Processing (NLP). Secondly, we assessed and measured machine learning and deep neural network models on our UIT-VSMEC. As a result, Convolutional Neural Network (CNN) model achieved the highest performance with 57.61% of F1-score.
Paper: Vong Ho, Duong Nguyen, Danh Nguyen, Linh Pham, Kiet Nguyen and Ngan Nguyen, Emotion Recognition for Vietnamese Social Media Text, 2019 16th International Conference of the Pacific Association for Computational Linguistics (PACLING 2019), October 11-13, 2019, Ha Noi, Vietnam. Link.
https://sites.google.com/uit.edu.vn/uit-nlp/datasets-projects
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
A geospatial dataset containing polylines of transportation network in Vietnam. It contains the railways, the principal roads and the secondary roads.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Bud500: A Comprehensive Vietnamese ASR Dataset
Introducing Bud500, a diverse Vietnamese speech corpus designed to support ASR research community. With aprroximately 500 hours of audio, it covers a broad spectrum of topics including podcast, travel, book, food, and so on, while spanning accents from Vietnam's North, South, and Central regions. Derived from free public audio resources, this publicly accessible dataset is designed to significantly enhance the work of developers and… See the full description on the dataset page: https://huggingface.co/datasets/linhtran92/viet_bud500.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The Vietnam Air Quality Data Series 2020 provides air quality values for several cities in Vietnam. Readers can access online data or historical data saved as of January 21, 2021 by zone GMT + 7
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about cities in Vietnam. It has 65 rows. It features 7 columns including country, population, latitude, and longitude.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about athletes in Vietnam. It has 15 rows. It features 8 columns including birth date, country, gender, and sport.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The research questionnaire was designed by adaptation measures from previous researchs for Vietnamese context. We conducted the data collection by using Google docs. We upload soft electronic copies of survey questionnaire online. The questionnaires were sent to about 1902 email addresses, which were collected from student alumni of 5 universities in Hanoi –the capital of Vietnam. We received 510 responses (response rate of 26.8%). After screening the questionnaires, bias answers were eliminated. The final sample size consists of 502 responses.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset is polygon data with geospatial referecing. The World Database on Protected Areas (WDPA) is the most comprehensive global spatial dataset on marine and terrestrial protected areas available. Protected areas are internationally recognised as major tools in conserving species and ecosystems. Up to date information on protected areas is essential to enable a wide range of conservation and development activities.
This dataset provides data on the occurrence and impacts of mass disasters in Vietnam from 1900 to 2024. This includes both natural (biological, climatological, extra-terrestrial, geophysical, hydrological, meteorological), and technological (industrial accident) disasters. Data was extracted from The International Disaster Database, Centre for Research on the Epidemiology of Disasters.
Traffic Flow Data In Ho Chi Minh City, Viet Nam
This dataset falls under the category Traffic Generating Parameters.
It contains the following data: Traffic flow
This dataset was scouted on 2022-02-10 as part of a data sourcing project conducted by TUMI. License information might be outdated: Check original source for current licensing.
The data can be accessed using the following URL / API Endpoint: https://www.kaggle.com/thanhnguyen2612/traffic-flow-data-in-ho-chi-minh-city-viet-nam
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product (GDP) in Vietnam was worth 476.39 billion US dollars in 2024, according to official data from the World Bank. The GDP value of Vietnam represents 0.45 percent of the world economy. This dataset provides the latest reported value for - Vietnam GDP - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Dataset of soil types of Vietnam is a geospatial polygon data which is based on FAO classification.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Exports in Vietnam decreased to 31.11 USD Billion in February from 33.09 USD Billion in January of 2025. This dataset provides the latest reported value for - Vietnam Exports - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The USD/VND exchange rate was unchanged at 26,345.0000 on September 1, 2025. Over the past month, the Vietnamese Dong has weakened 0.48%, and is down by 5.97% over the last 12 months. Vietnamese Dong - values, historical data, forecasts and news - updated on September of 2025.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Vietnam email data to connect with prominent professionals, increasing your sales and growing your market presence. Tap into the growing Vietnamese market with our comprehensive Vietnam email data. This premium list provides access to engaged consumers and businesses. As a result, you can broaden your customer base and increase sales. Moreover, our data is carefully compiled and validated. Therefore, you can ensure high deliverability and engagement. Consequently, you can tailor your marketing messages for maximum impact. Furthermore, this valuable resource enables you to build lasting relationships. Finally, List to Data offers this targeted dataset to help you succeed in Vietnam. Vietnam consumer email list empowers you to build valuable relationships with potential customers, fostering brand loyalty and driving repeat business. Access the Vietnamese market with our premium Vietnam consumer email list. This comprehensive resource provides access to a vast network of potential customers. As a result, you can increase your brand visibility and drive sales. Moreover, our data is regularly updated and verified. Therefore, you can improve your marketing ROI. Consequently, you can target specific demographics and regions. Furthermore, this valuable resource allows you to connect with key decision-makers. Finally, List to Data offers this powerful dataset to fuel your business growth in Vietnam. Vietnam business email list is a powerful resource for reaching professionals in Vietnam. This database provides verified leads to ensure your campaigns are effective. Additionally, it is designed to save time and maximize ROI. Moreover, the directory is regularly updated for accuracy. Furthermore, it offers a seamless way to expand your market reach. As a result, you can enhance your marketing efforts with reliable information. In addition, this library of contacts is tailored for both B2B and B2C outreach. Finally, trust List To Data to deliver a dataset that drives results and boosts your market presence.
Wikipedia
Source: https://huggingface.co/datasets/wikipedia Num examples: 1,281,412 Language: Vietnamese
from datasets import load_dataset
load_dataset("tdtunlp/wikipedia_vi")