Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
wikipedia Korean 2024.5.1 cut
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product (GDP) in South Korea was worth 1712.79 billion US dollars in 2023, according to official data from the World Bank. The GDP value of South Korea represents 1.62 percent of the world economy. This dataset provides - South Korea GDP - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The benchmark interest rate in South Korea was last recorded at 2.50 percent. This dataset provides - South Korea Interest Rate - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product per capita in South Korea was last recorded at 34121.02 US dollars in 2023. The GDP per Capita in South Korea is equivalent to 270 percent of the world's average. This dataset provides - South Korea GDP per capita - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The USD/KRW exchange rate rose to 1,385.6400 on September 9, 2025, up 0.03% from the previous session. Over the past month, the South Korean Won has strengthened 0.38%, but it's down by 3.20% over the last 12 months. South Korean Won - values, historical data, forecasts and news - updated on September of 2025.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The ethnobotany is important in two aspects. The first one concerns the fact that the ethnobotanical plants have been with us for a long time and they have been used for many purposes including medicine. The second one concerns that their effectiveness in many fields has been proved and we can extract useful materials from them. The environmental problems due to a rapid economical development made their population to be decreased continually. So we need to have a systematic plan to conserve them. This research aims at collecting and organizing the data on Korean ethnobotany. Additionally we will input the data to a computer database and construct a web system to make it to be shared by domestic and international researchers.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product per capita in South Korea was last recorded at 49995.49 US dollars in 2023, when adjusted by purchasing power parity (PPP). The GDP per Capita, in South Korea, when adjusted by Purchasing Power Parity is equivalent to 281 percent of the world's average. This dataset provides the latest reported value for - South Korea GDP per capita PPP - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
KDPII: A New Korean Dialogic Dataset for the Deidentification of Personally Identifiable Information
The rapid growth of social media in the era of big data and artificial intelligence has raised significant safety concerns related to the communication of sensitive personal information. In modern society, awareness of the importance of preserving privacy is growing, so there is a rising advocacy for adopting language modeling technology to mitigate the risk of personal information leakage and to deidentify sensitive information depending on the situation. Thus far, several theoretical analyses of privacy protection in Korea have been conducted. However, the technical development of language model training resources for Korean has been slower than those of widely spoken languages such as English and Chinese. To address this problem, we developed a comprehensive and organized framework for classifying Korean personally identifiable information (PII) by investigating pertinent examples, such as โText Anonymization Benchmarkโ and โNetwork Intrusion Detection Dataset,โ from within and outside Korea. Subsequently, we created a new Korean dataset for PII deidentification, KDPII, which consists of many conversational texts incorporating plentiful Korean PII. Based on this, we examined the Korean PII processing performances of many representative language models that are available on the market. Finally, we found that although the performance of language models in identifying PII varied by model size, model architecture, and training source, most of them were significantly better at recognizing universal PII than language-specific PII, which indicates a prospective direction of expanding training data for implementing Korean-specific PII deidentification in the future.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Although the risk for depression appears to be related to daily dietary habits, how the proportion of major macronutrients affects the occurrence of depression remains largely unknown. This study aims to estimate the association between macronutrients (i.e., carbohydrate, protein, fat) and depression through national survey datasets from the United States and South Korea. Association between the prevalence of depression and each macronutrient was measured from 60,935 participants from the National Health and Nutrition Examination Survey (NHANES) and 15,700 participants from the South Korea NHANES (K-NHANES) databases. When the proportion of calories intake by protein increased by 10%, the prevalence of depression was significantly reduced both in the United States [Odds Ratio, OR (95% CI), 0.621 (0.530โ0.728)] and South Korea [0.703 (0.397โ0.994)]. An association between carbohydrate intake and the prevalence of depression was seen in the United States [1.194 (1.116โ1.277)], but not in South Korea. Fat intake was not significantly associated with depression in either country. Subsequent analysis showed that the low protein intake groups had significantly higher risk for depression than the normal protein intake groups in both the United States [1.648 (1.179โ2.304)] and South Korea [3.169 (1.598โ6.286)]. In the daily diet of macronutrients, the proportion of protein intake is significantly associated with the prevalence of depression. These associations were more prominent in adults with insufficient protein intake, and the pattern of association between macronutrients and depression in Asian American and South Korean populations were similar. Our findings suggest that the proportion of macronutrients intake in everyday life may be related to the occurrence of depression.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
South Korea recorded a trade surplus of 6610 USD Million in July of 2025. This dataset provides the latest reported value for - South Korea Balance of Trade - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
This data package includes the underlying data files to replicate the data, tables, and charts presented in How the United States solved South Koreaโs problems with electric vehicle subsidies under the Inflation Reduction Act, PIIE Working Paper 23-6.
If you use the data, please cite as: Bown, Chad P. 2023. How the United States solved South Koreaโs problems with electric vehicle subsidies under the Inflation Reduction Act. PIIE Working Paper 23-6. Washington, DC: Peterson Institute for International Economics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Money Supply M2 in South Korea increased to 4314652.60 KRW Billion in June from 4278815.60 KRW Billion in May of 2025. This dataset provides - South Korea Money Supply M2 - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Pairs of Korean speakers reading a script with , with , are recorded. The recordings took place in , which are an anechoic chamber, studio apartment, and dance studio, of which the level of reverberation differs. And in order to examine the effect of the distance of mic from the source and device, every experiment is recorded at with , iPhone X, and Galaxy S7.
There were two individuals in a group, and each group was distinguished by a unique key(subject ID). Two speakers(speaker a, speaker b) were facing each other 1.4m apart from each other (0.7m from the middle line). They read the allocated scripts alternating between speaker a and b, as if they were talking to each other.
Recording environment | Studio Apartment(moderate reverb), Dance studio(high reverb), Anechoic Chamber(no reverb) |
---|---|
Device | iPhone X(iOS), Samsung Galaxy S7 |
Recording distance from the source | 0.4m, 2.0m, 4.0m |
Volume(Sample) | ~ 290(~ 3) hours, ~ 190,000(~ 2,000) utterances, ~ 107(~ 0.5) GB |
Format | wav/h5(16/44.1kHz, 16-bit, mono) |
Language | Korean |
Studio Apartment | Dance studio | Anechoic Chamber |
---|---|---|
Refer to the dataset descriptions in 'docs' for detailed description and statistics of the full set of the dataset.
The dataset is a subset(approximately 1%) of a much bigger dataset which were recorded under the same circumstances as these open source datasets. Please contact us(contact@deeplyinc.com) for the full set with the commercial license.
The illustrations below are the statistics about the Deeply Korean Read Speech Corpus. The first three are from the sample dataset, And the others are from the full dataset. To attain more insight about the dataset, please refer to the detailed description in 'docs' and 'Korea_Read_Speech_Corpus.json' in 'Dataset'.
The sample is a partial set of recordings from a single subject group(sub1001), which consists of 20-year-old female(speaker a) and 49-year-old female(speaker b).
https://github.com/deeplyinc/Korean-Read-Speech-Corpus/blob/main/etc/fig0.png?raw=true">
https://github.com/deeplyinc/Korean-Read-Speech-Corpus/blob/main/etc/fig1.png?raw=true">
https://github.com/deeplyinc/Korean-Read-Speech-Corpus/blob/main/etc/fig2.png?raw=true">
https://github.com/deeplyinc/Korean-Read-Speech-Corpus/blob/main/etc/fig3.png?raw=true">
https://github.com/deeplyinc/Korean-Read-Speech-Corpus/blob/main/etc/fig4.png?raw=true">
https://github.com/deeplyinc/Korean-Read-Speech-Corpus/blob/main/etc/fig5.png?raw=true">
https://github.com/deeplyinc/Korean-Read-Speech-Corpus/blob/main/etc/fig6.png?raw=true">
โโโ Dataset
โ โโโ AirbnbStudio
โ โ โโโ sub100100a00000.wav
โ โ โโโ ...
โ โโโ AnechoicChamber
โ โ โโโ sub100120a00000.wav
โ โ โโโ ...
โ โโโ DanceStudio
โ โ โโโ sub100110a00000.wav
โ โ โโโ ...
โ โโโ Korean_Read_Speech_Corpus.json
โโโ docs
โโโ Deeply Korean Read Speech Corpus_Eng.pdf
โโโ Deeply Korean Read Speech Corpus_Kor.pdf
Korean_Read_Speech_Corpus.json
{'AirbnbStudio':
{'sub100100a00000': {'text_sentiment': 0,
'voice_sentiment': -1,
'subjectID': 'sub1001',
'speaker': 'a',
'age': 20,
'sex': 0,
'noise': 0,
'location': 0,
'distance': 0,
'device': 0,
'text': '์ ์๋น ์์์ด ์ ๋ง ๋ง์๋ ๋ด์.',
'text_code': 'aa0',
'rms': 0.024304501712322235,
'length': 2.71825},
...
},
...
}
Text sentiment: {-1: negative, 0: neutral, 1: positive}
Vocie sentiment: {-1: negative, 0: neutral, 1: positive}
Subject ID: Unique 'sub + 4-digit' key allocated to each subject group
Speaker: unique key allocated to each indiivdual in the subject group.
Sex: {0: Female, 1: Male}
Noise: {0: Noiseless, 1: Indoor no...
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset is provided by AIxBlock, an unified platform for AI development and AI workflows automation. This dataset contains around 700k sentences in Korean, making it a valuable resource for a wide range of language technology applications. All data has undergone quality assurance (QA) checks to ensure clarity, correctness, and natural phrasing. The dataset is well-suited for: Speech data generation (e.g., recording short audio clips lasting 8โ30 seconds per sentence) Natural Languageโฆ See the full description on the dataset page: https://huggingface.co/datasets/AIxBlock/Korean-short-utterances.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product (GDP) in North Korea was worth 18 billion US dollars in 2019, according to official data from the World Bank. The GDP value of North Korea represents 0.02 percent of the world economy. This dataset provides - North Korea GDP - actual values, historical data, forecast, chart, statistics, economic calendar and news.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for KoBBQ
The Bias Benchmark for Question Answering (BBQ) is designed to evaluate social biases of language models (LMs), but it is not simple to adapt this benchmark to cultural contexts other than the US because social biases depend heavily on the cultural context. In this paper, we present KoBBQ, a Korean bias benchmark dataset, and we propose a general framework that addresses considerations for cultural adaptation of a dataset. Our framework includesโฆ See the full description on the dataset page: https://huggingface.co/datasets/naver-ai/kobbq.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
wikipedia Korean 2024.5.1 cut