85 datasets found
  1. Spatiotemporal data on Chinese population distribution from 1949 to 2013

    • springernature.figshare.com
    zip
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lizhe Wang; Lajiao Chen (2023). Spatiotemporal data on Chinese population distribution from 1949 to 2013 [Dataset]. http://doi.org/10.6084/m9.figshare.c.3291368_D2.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Lizhe Wang; Lajiao Chen
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Population: This dataset contains 65-years’ time serial data of whole China (unit: million persons), each provinces (unit: 10000 persons), and each county. The source data are originally collected from China Statistical Yearbook from 1949 to 2013. The county data covers 2000, 2006, 2007, and 2009. In addition, 4 years (1995, 2000, 2005, 2010) population distributions cover the whole land region in China are also included in this dataset. Such data is expressed as raster format with 1 km resolution and a projection of Albers. Attribute information mainly includes population density (unit: number of person per square kilometer). The source data are originally provided by Data Center for Resources and Environmental Sciences, Chinese Academy of Sciences (RESDC) (http://www.resdc.cn) and Data Sharing Infrastructure of Earth System Science (http://www.geodata.cn).These data are not intended for demarcation.

  2. Population development of China 0-2100

    • statista.com
    Updated Aug 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Population development of China 0-2100 [Dataset]. https://www.statista.com/statistics/1304081/china-population-development-historical/
    Explore at:
    Dataset updated
    Aug 7, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    China
    Description

    The region of present-day China has historically been the most populous region in the world; however, its population development has fluctuated throughout history. In 2022, China was overtaken as the most populous country in the world, and current projections suggest its population is heading for a rapid decline in the coming decades. Transitions of power lead to mortality The source suggests that conflict, and the diseases brought with it, were the major obstacles to population growth throughout most of the Common Era, particularly during transitions of power between various dynasties and rulers. It estimates that the total population fell by approximately 30 million people during the 14th century due to the impact of Mongol invasions, which inflicted heavy losses on the northern population through conflict, enslavement, food instability, and the introduction of bubonic plague. Between 1850 and 1870, the total population fell once more, by more than 50 million people, through further conflict, famine and disease; the most notable of these was the Taiping Rebellion, although the Miao an Panthay Rebellions, and the Dungan Revolt, also had large death tolls. The third plague pandemic also originated in Yunnan in 1855, which killed approximately two million people in China. 20th and 21st centuries There were additional conflicts at the turn of the 20th century, which had significant geopolitical consequences for China, but did not result in the same high levels of mortality seen previously. It was not until the overlapping Chinese Civil War (1927-1949) and Second World War (1937-1945) where the death tolls reached approximately 10 and 20 million respectively. Additionally, as China attempted to industrialize during the Great Leap Forward (1958-1962), economic and agricultural mismanagement resulted in the deaths of tens of millions (possibly as many as 55 million) in less than four years, during the Great Chinese Famine. This mortality is not observable on the given dataset, due to the rapidity of China's demographic transition over the entire period; this saw improvements in healthcare, sanitation, and infrastructure result in sweeping changes across the population. The early 2020s marked some significant milestones in China's demographics, where it was overtaken by India as the world's most populous country, and its population also went into decline. Current projections suggest that China is heading for a "demographic disaster", as its rapidly aging population is placing significant burdens on China's economy, government, and society. In stark contrast to the restrictive "one-child policy" of the past, the government has introduced a series of pro-fertility incentives for couples to have larger families, although the impact of these policies are yet to materialize. If these current projections come true, then China's population may be around half its current size by the end of the century.

  3. Z

    Modern China Geospatial Database - Main Dataset

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Henriot (2025). Modern China Geospatial Database - Main Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5735393
    Explore at:
    Dataset updated
    Feb 28, 2025
    Dataset authored and provided by
    Christian Henriot
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    MCGD_Data_V2.2 contains all the data that we have collected on locations in modern China, plus a number of locations outside of China that we encounter frequently in historical sources on China. All further updates will appear under the name "MCGD_Data" with a time stamp (e.g., MCGD_Data2023-06-21)

    You can also have access to this dataset and all the datasets that the ENP-China makes available on GitLab: https://gitlab.com/enpchina/IndexesEnp

    Altogether there are 464,970 entries. The data include the name of locations and their variants in Chinese, pinyin, and any recorded transliteration; the name of the province in Chinese and in pinyin; Province ID; the latitude and longitude; the Name ID and Location ID, and NameID_Legacy. The Name IDs all start with H followed by seven digits. This is the internal ID system of MCGD (the NameID_Legacy column records the Name IDs in their original format depending on the source). Locations IDs that start with "DH" are data points extracted from China Historical GIS (Harvard University); those that start with "D" are locations extracted from the data points in Geonames; those that have only digits (8 digits) are data points we have added from various map sources.

    One of the main features of the MCGD Main Dataset is the systematic collection and compilation of place names from non-Chinese language historical sources. Locations were designated in transliteration systems that are hardly comprehensible today, which makes it very difficult to find the actual locations they correspond to. This dataset allows for the conversion from these obsolete transliterations to the current names and geocoordinates.

    From June 2021 onward, we have adopted a different file naming system to keep track of versions. From MCGD_Data_V1 we have moved to MCGD_Data_V2. In June 2022, we introduced time stamps, which result in the following naming convention: MCGD_Data_YYYY.MM.DD.

    UPDATES

    MCGD_Data2025_02_28 includes a major change with the duplication of all the locations listed under Beijing, Shanghai, Tianjin, and Chongqing (北京, 上海, 天津, 重慶) and their listing under the name of the provinces to which they belonge origially before the creation of the four special municipalities after 1949. This is meant to facilitate the matching of data from historical sources. Each location has a unique NameID. Altogether there are 472,818 entries

    MCGD_Data2025_02_27 inclues an update on locations extracted from Minguo zhengfu ge yuanhui keyuan yishang zhiyuanlu 國民政府各院部會科員以上職員錄 (Directory of staff members and above in the ministries and committees of the National Government). Nanjing: Guomin zhengfu wenguanchu yinzhuju 國民政府文官處印鑄局國民政府文官處印鑄局, 1944). We also made corrections in the Prov_Py and Prov_Zh columns as there were some misalignments between the pinyin name and the name in Chines characters. The file now includes 465,128 entries.

    MCGD_Data2024_03_23 includes an update on locations in Taiwan from the Asia Directories. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown").

    MCGD_Data2023.12.22 contains all the data that we have collected on locations in China, whatever the period. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown"). The dataset also includes locations outside of China for the purpose of matching such locations to the place names extracted from historical sources. For example, one may need to locate individuals born outside of China. Rather than maintaining two separate files, we made the decision to incorporate all the place names found in historical sources in the gazetteer. Such place names can easily be removed by selecting all the entries where the 'Province' data is missing.

  4. F

    Mandarin General Conversation Speech Dataset for ASR

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Mandarin General Conversation Speech Dataset for ASR [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/general-conversation-mandarin-china
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    Welcome to the Mandarin Chinese General Conversation Speech Dataset — a rich, linguistically diverse corpus purpose-built to accelerate the development of Mandarin speech technologies. This dataset is designed to train and fine-tune ASR systems, spoken language understanding models, and generative voice AI tailored to real-world Mandarin Chinese communication.

    Curated by FutureBeeAI, this 30 hours dataset offers unscripted, spontaneous two-speaker conversations across a wide array of real-life topics. It enables researchers, AI developers, and voice-first product teams to build robust, production-grade Mandarin speech models that understand and respond to authentic Chinese accents and dialects.

    Speech Data

    The dataset comprises 30 hours of high-quality audio, featuring natural, free-flowing dialogue between native speakers of Mandarin Chinese. These sessions range from informal daily talks to deeper, topic-specific discussions, ensuring variability and context richness for diverse use cases.

    Participant Diversity:
    Speakers: 60 verified native Mandarin Chinese speakers from FutureBeeAI’s contributor community.
    Regions: Representing various provinces of China to ensure dialectal diversity and demographic balance.
    Demographics: A balanced gender ratio (60% male, 40% female) with participant ages ranging from 18 to 70 years.
    Recording Details:
    Conversation Style: Unscripted, spontaneous peer-to-peer dialogues.
    Duration: Each conversation ranges from 15 to 60 minutes.
    Audio Format: Stereo WAV files, 16-bit depth, recorded at 16kHz sample rate.
    Environment: Quiet, echo-free settings with no background noise.

    Topic Diversity

    The dataset spans a wide variety of everyday and domain-relevant themes. This topic diversity ensures the resulting models are adaptable to broad speech contexts.

    Sample Topics Include:
    Family & Relationships
    Food & Recipes
    Education & Career
    Healthcare Discussions
    Social Issues
    Technology & Gadgets
    Travel & Local Culture
    Shopping & Marketplace Experiences, and many more.

    Transcription

    Each audio file is paired with a human-verified, verbatim transcription available in JSON format.

    Transcription Highlights:
    Speaker-segmented dialogues
    Time-coded utterances
    Non-speech elements (pauses, laughter, etc.)
    High transcription accuracy, achieved through double QA pass, average WER < 5%

    These transcriptions are production-ready, enabling seamless integration into ASR model pipelines or conversational AI workflows.

    Metadata

    The dataset comes with granular metadata for both speakers and recordings:

    Speaker Metadata: Age, gender, accent, dialect, state/province, and participant ID.
    Recording Metadata: Topic, duration, audio format, device type, and sample rate.

    Such metadata helps developers fine-tune model training and supports use-case-specific filtering or demographic analysis.

    Usage and Applications

    This dataset is a versatile resource for multiple Mandarin speech and language AI applications:

    ASR Development: Train accurate speech-to-text systems for Mandarin Chinese.
    Voice Assistants: Build smart assistants capable of understanding natural Chinese conversations.
    <div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px;

  5. Metadata document for spatiotemporal dataset on Chinese population...

    • springernature.figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lizhe Wang; Lajiao Chen (2023). Metadata document for spatiotemporal dataset on Chinese population distribution and its driving factors from 1949 to 2013 [Dataset]. http://doi.org/10.6084/m9.figshare.c.3291368_D3.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Lizhe Wang; Lajiao Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Metadata document for datasets included in this data collection.

  6. T

    China Unemployment Rate

    • tradingeconomics.com
    • jp.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). China Unemployment Rate [Dataset]. https://tradingeconomics.com/china/unemployment-rate
    Explore at:
    csv, xml, excel, jsonAvailable download formats
    Dataset updated
    Jun 16, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Sep 30, 2002 - May 31, 2025
    Area covered
    China
    Description

    Unemployment Rate in China decreased to 5 percent in May from 5.10 percent in April of 2025. This dataset provides - China Unemployment Rate - actual values, historical data, forecast, chart, statistics, economic calendar and news.

  7. T

    Kilometer grid dataset of China's historical population spatial distribution...

    • data.tpdc.ac.cn
    • tpdc.ac.cn
    zip
    Updated Sep 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Can WANG; Jiachen WANG (2022). Kilometer grid dataset of China's historical population spatial distribution (1990-2015) [Dataset]. http://doi.org/10.12078/2017121101
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 21, 2022
    Dataset provided by
    TPDC
    Authors
    Can WANG; Jiachen WANG
    Area covered
    Description

    Provide detailed spatial distribution of population data in China from 1990 to 2015 year by year. The data is 1km grid data, with population pop as the indicator. The grid data comprehensively considers multiple factors for weight distribution to realize the spatialization of population, which is convenient for data sharing and spatial statistical analysis. The data comes from the Resource and Environmental Science and Data Center of the Institute of Geographic Science and Resources, Chinese Academy of Sciences. The annual data is obtained by linear interpolation of the original data, and saved in geotiff file format. The methods and standards of data over the years are consistent, the coverage is complete, and the collection and processing process is traceable and reliable.

  8. M

    China Population (1950-2025)

    • macrotrends.net
    csv
    Updated May 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MACROTRENDS (2025). China Population (1950-2025) [Dataset]. https://www.macrotrends.net/global-metrics/countries/chn/china/population
    Explore at:
    csvAvailable download formats
    Dataset updated
    May 31, 2025
    Dataset authored and provided by
    MACROTRENDS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description
    Total current population for China in 2025 is 1,424,381,924, a 0.06% decline from 2024.
    <ul style='margin-top:20px;'>
    
    <li>Total population for China in 2024 was <strong>1,425,178,782</strong>, a <strong>1.03% increase</strong> from 2023.</li>
    <li>Total population for China in 2023 was <strong>1,410,710,000</strong>, a <strong>0.1% decline</strong> from 2022.</li>
    <li>Total population for China in 2022 was <strong>1,412,175,000</strong>, a <strong>0.01% decline</strong> from 2021.</li>
    </ul>Total population is based on the de facto definition of population, which counts all residents regardless of legal status or citizenship. The values shown are midyear estimates.
    
  9. u

    Reevaluating Political Trust and Social Desirability in China - Dataset -...

    • bsos-data.umd.edu
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reevaluating Political Trust and Social Desirability in China - Dataset - BSOS Data Repository [Dataset]. https://bsos-data.umd.edu/dataset/making-the-list-reevaluating-political-trust-and-social-desirability-in-china
    Explore at:
    Area covered
    China
    Description

    The data comes from the Harvard Dataverse and covers information regarding political trust & regime support in China and self-monitoring, which determines the participants' desire for social desirability. Authors Nicholson and Huang obtained the data via a standard survey experiment that contains an embedded list experiment. The list experiment aspect is significant because list experiments are an "indirect way to gauge overreporting" (Nicholson and Haung). The data have possibilities for helping understand Chinese politics, such as how support varies at different government levels and how overreporting is affected by a person's social desirability. This data can be used in government classes and coding classes. The data should be used when learning about ordered logit and simple bar graphs. A regression should not be used. It could be used to compare the levels of trust in different regime types. It would be interesting to compare the results of other authoritarian countries, such as Turkey and Vietnam, to the results of these datasets from China. Additionally, data from these countries could be compared to democracies. People underreport in authoritarian governments and might not always tell the truth, so there is a chance that authoritarian countries could have similar levels of reported trust to the democratic countries. This experiment is also a list experiment, which reduces some of the underreporting. The data can be used to see whether certain demographic characteristics have more or less support for their government. Examples of demographic characteristics that could be looked at are gender, age, and education level.

  10. T

    China Average Yearly Wages

    • tradingeconomics.com
    • de.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Dec 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2024). China Average Yearly Wages [Dataset]. https://tradingeconomics.com/china/wages
    Explore at:
    json, xml, csv, excelAvailable download formats
    Dataset updated
    Dec 15, 2024
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 31, 1952 - Dec 31, 2024
    Area covered
    China
    Description

    Wages in China increased to 120698 CNY/Year in 2023 from 114029 CNY/Year in 2022. This dataset provides - China Average Yearly Wages - actual values, historical data, forecast, chart, statistics, economic calendar and news.

  11. S

    Data from: Information dataset of China’s overseas industrial parks from...

    • scidb.cn
    Updated Jul 4, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    李祜梅; 邬明权; 牛铮; 李旗 (2019). Information dataset of China’s overseas industrial parks from 1992 to 2018 [Dataset]. http://doi.org/10.11922/sciencedb.797
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 4, 2019
    Dataset provided by
    Science Data Bank
    Authors
    李祜梅; 邬明权; 牛铮; 李旗
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    Since China’s initiative of the Belt and Road Initiative, overseas industrial parks have become an important carrier for economic and trade cooperation and have become an important force for Chinese enterprises to go global.However, although there are many industrial parks invested by Chinese companies abroad, there is not yet a comprehensive statistical work that is crucial for national or corporate investors.The start-up time of some parks and the name of Chinese enterprises that are under construction are difficult to find, so comprehensive statistical work is relatively difficult.This paper collects data through the network crawling technology, the public number of the Belt and Road International Industrial Park, the official website of the major enterprises participating in the Belt and Road construction, and the database of the Ministry of Commerce.Under the most comprehensive collection possible, a detailed data set of the China Outland Campus Belt and Road Project from 1992 to 2018 was compiled.This data set summarizes the existing park names and determines the total number of parks currently built in China; statistics on the number of parks on each continent to understand the distribution of the park; then analyze the type of the park, and understand the distribution of resources in the area by type; finally,compare the time between the construction of the park and the time of the country where the park is located join the Asian Infrastructure Investment Bank(AIIB) to know the relationship between the AIIB and the park.

  12. China Agricultural and Economic Data

    • catalog.data.gov
    • datasets.ai
    • +3more
    Updated Apr 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Economic Research Service, Department of Agriculture (2025). China Agricultural and Economic Data [Dataset]. https://catalog.data.gov/dataset/china-agricultural-and-economic-data
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    Economic Research Servicehttp://www.ers.usda.gov/
    Area covered
    China
    Description

    Note: Updates to this data product are discontinued. The China agricultural and economic database is a collection of agricultural-related data from official statistical publications of the People's Republic of China. Analysts and policy professionals around the world need information about the rapidly changing Chinese economy, but statistics are often published only in China and sometimes only in Chinese-language publications. This product assembles a wide variety of data items covering agricultural production, inputs, prices, food consumption, output of industrial products relevant to the agricultural sector, and macroeconomic data.

  13. Economic driving factors for Chinese population 1949 to 2013

    • springernature.figshare.com
    zip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lizhe Wang; Lajiao Chen (2023). Economic driving factors for Chinese population 1949 to 2013 [Dataset]. http://doi.org/10.6084/m9.figshare.c.3291368_D5.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Lizhe Wang; Lajiao Chen
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The economic factors present in this dataset include data items of gross domestic product (GDP) (100 million), per-capita GDP (yuan/people), primary industry (100 million), secondary industry (100 million), tertiary industry (100 million) and total investment in fixed assets (100 million). Time serial data from 1949 to 2013 of whole China and all the provinces are included. All of data were collected from the China Statistical Yearbook from 1981 to 2014 and China Compendium of Statistics from 1949 to 2008.These data are not intended for demarcation.

  14. T

    China Youth Unemployment Rate

    • tradingeconomics.com
    • ru.tradingeconomics.com
    • +12more
    csv, excel, json, xml
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). China Youth Unemployment Rate [Dataset]. https://tradingeconomics.com/china/youth-unemployment-rate
    Explore at:
    excel, xml, json, csvAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 31, 2021 - May 31, 2025
    Area covered
    China
    Description

    Youth Unemployment Rate in China decreased to 14.90 percent in May from 15.80 percent in April of 2025. This dataset includes a chart with historical data for China Youth Unemployment Rate.

  15. China Multi-Generational Panel Dataset, Liaoning (CMGPD-LN), 1749-1909 -...

    • search.gesis.org
    Updated May 30, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ICPSR - Interuniversity Consortium for Political and Social Research (2021). China Multi-Generational Panel Dataset, Liaoning (CMGPD-LN), 1749-1909 - Version 10 [Dataset]. http://doi.org/10.3886/ICPSR27063.v10
    Explore at:
    Dataset updated
    May 30, 2021
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    GESIS search
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de448898https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de448898

    Area covered
    Liaoning, China
    Description

    Abstract (en): The China Multi-Generational Panel Dataset - Liaoning (CMGPD-LN) is drawn from the population registers compiled by the Imperial Household Agency (neiwufu) in Shengjing, currently the northeast Chinese province of Liaoning, between 1749 and 1909. It provides 1.5 million triennial observations of more than 260,000 residents from 698 communities. The population mainly consists of immigrants from North China who settled in rural Liaoning during the early eighteenth century, and their descendants. The data provide socioeconomic, demographic, and other characteristics for individuals, households, and communities, and record demographic outcomes such as marriage, fertility, and mortality. The data also record specific disabilities for a subset of adult males. Additionally, the collection includes monthly and annual grain price data, custom records for the city of Yingkou, as well as information regarding natural disasters, such as floods, droughts, and earthquakes. This dataset is unique among publicly available population databases because of its time span, volume, detail, and completeness of recording, and because it provides longitudinal data not just on individuals, but on their households, descent groups, and communities. Possible applications of the dataset include the study of relationships between demographic behavior, family organization, and socioeconomic status across the life course and across generations, the influence of region and community on demographic outcomes, and development and assessment of quantitative methods for the analysis of complex longitudinal datasets. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Created variable labels and/or value labels.; Standardized missing values.; Created online analysis version with question text.; Checked for undocumented or out-of-range codes.. Smallest Geographic Unit: Chinese banners (8) The data are from 725 surviving triennial registers from 29 distinct populations. Each of the 29 register series corresponded to a specific rural population concentrated in a small number of neighboring villages. These populations were affiliated with the Eight Banner civil and military administration that the Qing state used to govern northeast China as well as some other parts of the country. 16 of the 29 populations are regular bannermen. In these populations adult males had generous allocations of land from the state, and in return paid an annual fixed tax to the Imperial Household Agency, and provided to the Imperial Household Agency such home products as homespun fabric and preserved meat, and/or such forest products as mushrooms. In addition, as regular bannermen they were liable for military service as artisans and soldiers which, while in theory an obligation, was actually an important source of personal revenue and therefore a political privilege. 8 of the 29 populations are special duty banner populations. As in the regular banner population, the adult males in the special duty banner populations also enjoyed state allocated land free of rent. These adult males were also assigned to provide special services, including collecting honey, raising bees, fishing, picking cotton, and tanning and dyeing. The remaining populations were a diverse mixture of estate banner and servile populations. The populations covered by the registers, like much of the population of rural Liaoning in the eighteenth and nineteenth centuries, were mostly descendants of Han Chinese settlers who came from Shandong and other nearby provinces in the late seventeenth and early eighteenth centuries in response to an effort by the Chinese state to repopulate the region. 2016-09-06 2016-09-06 The Training Guide has been updated to version 3.60. Additionally, the Principal Investigator affiliation has been corrected, and cover sheets for all PDF documents have been revised.2014-07-10 Releasing new study level documentation that contains the tables found in the appendix of the Analytic dataset codebook.2014-06-10 The data and documentation have been updated following re-evaluation.2014-01-29 Fixing variable format issues. Some variables that were supposed to be s...

  16. Chinese Domestic Databases Market Report | Global Forecast From 2025 To 2033...

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Chinese Domestic Databases Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/chinese-domestic-databases-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Oct 16, 2024
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global, China
    Description

    Chinese Domestic Databases Market Outlook



    The Chinese Domestic Databases market size is set for robust growth, projected to grow from USD 2 billion in 2023 to USD 6.5 billion by 2032, reflecting an impressive CAGR of 13.5%. This growth is driven by the increasing demand for data sovereignty, technological advancements, and regulatory support from the Chinese government. The market is primed for expansion, propelled by factors such as the burgeoning digital economy, increased cloud adoption, and the strategic focus on indigenous technological advancements.



    One of the primary growth factors for the Chinese Domestic Databases market is the increasing emphasis on data sovereignty and security. With the Chinese government imposing stringent regulations on data storage and management, domestic companies are compelled to utilize local databases to ensure compliance. This has created a favorable environment for the growth of domestic database providers who are tailored to meet these unique requirements. Additionally, the rise in cyber threats has further driven the need for secure and reliable database solutions, contributing significantly to market growth.



    Technological advancements and innovation within the database industry are also pivotal growth drivers. The rapid development of Artificial Intelligence (AI) and Machine Learning (ML) technologies has allowed for more efficient and intelligent database management systems. Innovations in data handling, processing speed, and storage capabilities provide a significant competitive edge to domestic databases over international counterparts. Furthermore, the integration of AI and ML with databases enables advanced analytics and insights, helping businesses make more informed decisions, thus driving the market forward.



    The digital transformation across various sectors in China has also fueled the demand for robust database solutions. Sectors such as finance, healthcare, and retail are increasingly relying on digital platforms for their operations, necessitating sophisticated and reliable databases to manage vast amounts of data. The push towards a digital economy by the Chinese government, coupled with initiatives like the "New Infrastructure" program, which focuses on the development of digital infrastructure including big data centers, has significantly boosted the demand for domestic databases.



    Regionally, East China dominates the market due to the presence of major economic hubs like Shanghai and Hangzhou, which are home to numerous technology companies and data centers. North China, with Beijing as its central hub, also plays a significant role in the market due to the concentration of governmental bodies and financial institutions that demand secure and compliant database solutions. South China, particularly Shenzhen, is another critical region, given its prominence as a technology and innovation hub. Central China and other regions are gradually catching up as investments in digital infrastructure spread across the country. Overall, the regional dynamics of the Chinese Domestic Databases market present a diverse and rapidly evolving landscape.



    Type Analysis



    The Chinese Domestic Databases market comprises various types, including Relational Databases, NoSQL Databases, NewSQL Databases, and others. Relational Databases have been the cornerstone of the database industry for decades, offering structured data storage and easy retrieval through SQL queries. Despite their age, they remain highly relevant due to their robustness, reliability, and the vast ecosystems that have developed around them. In China, relational databases continue to be widely adopted across various industries, particularly in sectors like finance and government, where data accuracy and consistency are paramount.



    NoSQL Databases have gained significant traction in recent years due to their flexibility, scalability, and ability to handle unstructured data. Unlike traditional relational databases, NoSQL databases can seamlessly manage large volumes of diverse data types, making them ideal for applications in big data and real-time web applications. In China, the adoption of NoSQL databases is particularly prominent in the e-commerce and social media sectors, where the ability to scale out horizontally and handle high-velocity data is crucial.



    NewSQL Databases represent a hybrid approach that combines the best features of traditional relational databases and NoSQL databases. They offer the scalability and flexibility of NoSQL while maintaining the ACID (Atomicity, Consistency, Isolation, Durability) prope

  17. m

    The Motivations for Fashion Shopping in China (SPSS Dataset)

    • data.mendeley.com
    Updated Jul 2, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christopher J. Parker (2018). The Motivations for Fashion Shopping in China (SPSS Dataset) [Dataset]. http://doi.org/10.17632/bzn593sv5d.1
    Explore at:
    Dataset updated
    Jul 2, 2018
    Authors
    Christopher J. Parker
    License

    Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
    License information was derived automatically

    Area covered
    China
    Description

    In this study, 403 Chinese consumers generalizable to the broader population were surveyed on their motivations to shop for fashion apparel in both high street and e-commerce environments. Statistical analysis was undertaken through multiple T-Tests and MANOVA with the assistance of SPSS and G*Power.

    To increase the profits of international brands, this paper presents the motivations of Chinese consumers to engage in fashion retail, building upon established theory in hedonic and utilitarian motivations. With China set to capture over 24% of the $212 billion fashion market, international brands need to understand the unique motivations of Chinese consumers in order to capitalise on the market. However, the motivations of Chinese people to engage in fashion retail are as yet undefined, limiting the ability for international fashion retailers to operate with prosperity in the Chinese market.

  18. S

    Chinese Natural Speech Complex Emotion Dataset

    • scidb.cn
    Updated Feb 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaolong Wu; Mingxing Xu; Askar Hamdulla; Thomas Fang Zheng (2025). Chinese Natural Speech Complex Emotion Dataset [Dataset]. http://doi.org/10.57760/sciencedb.20968
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 24, 2025
    Dataset provided by
    Science Data Bank
    Authors
    Xiaolong Wu; Mingxing Xu; Askar Hamdulla; Thomas Fang Zheng
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Although Chinese speech affective computing has received increasing attention, existing datasets still have defects such as lack of naturalness, single pronunciation style, and unreliable annotation, which seriously hinder the research in this field. To address these issues, this paper introduces the first Chinese Natural Speech Complex Emotion Dataset (CNSCED) to provide natural data resources for Chinese speech affective computing. CNSCED was collected from publicly broadcasted civil dispute and interview television programs in China, reflecting the authentic emotional characteristics of Chinese people in daily life. The dataset includes 14 hours of speech data from 454 speakers of various ages, totaling 15777 samples. Based on the inherent complexity and ambiguity of natural emotions, this paper proposes an emotion vector annotation method. This method utilizes a vector composed of six meta-emotional dimensions (angry, sad, aroused, happy, surprise, and fear) of different intensities to describe any single or complex emotion. The CNSCED released two subtasks: complex emotion classification and complex emotion intensity regression. In the experimental section, we evaluated the CNSCED dataset using deep neural network models and provided a baseline result. To the best of our knowledge, CNSCED is the first public Chinese natural speech complex emotion dataset, which can be used for scientific research free of charge.

  19. h

    Supporting Data for "Essays on Chinese Ethnicity and The Rise of China:...

    • datahub.hku.hk
    Updated Apr 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Attawat Assavanadda (2025). Supporting Data for "Essays on Chinese Ethnicity and The Rise of China: Insights from Thailand" [Dataset]. http://doi.org/10.25442/hku.28321739.v1
    Explore at:
    Dataset updated
    Apr 30, 2025
    Dataset provided by
    HKU Data Repository
    Authors
    Attawat Assavanadda
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Area covered
    Thailand, China
    Description

    These datasets contain supporting data for "Essays on Chinese Ethnicity and Attitudes towards China: Insights from Thailand."These datasets are divided into three groups.Interview records and verbatim transcriptsField notesSurvey dataThe first group was obtained from in-person interviews I conducted in Bangkok, Thailand, from July to August 2024. The total number of interviews is 46, comprising 37 ethnic Chinese Thais and nine non-ethnic Chinese Thais. On average, each interview lasts 25 minutes. The verbatim transcripts, formatted as document files, contain the exact words of the interviews, with personal identifiers of both the interviewer and interviewee filtered out.The second group was also obtained during the interviews. In addition to audio recording, I jotted down some notes I learnt from the interviewee. All notes, except for two, were created digitally by typing on the Qualtrics platform and later downloaded as a readable dataset. The remaining two notes were created digitally by typing on Microsoft Word and saved as PDF files. The third group was obtained online and was further divided into two dub groups. The first sub-group, i.e., the pilot batch, was collected in March 2025 and comprises 110 responses. Each response contains demographic information, attitudes towards foreign countries, and reactions to a foreign country's public diplomacy efforts. The second sub-group, i.e., the full batch, was collected in April 2025 and comprises 1118 responses. Each response contains the same information as that of the pilot batch mentioned above. Please refer to HKU_DataSet_README.txt for further details.

  20. m

    Data from: A dataset of dynamical social map in ancient China: 618-1644

    • data.mendeley.com
    Updated Sep 30, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiongfei Jiang (2022). A dataset of dynamical social map in ancient China: 618-1644 [Dataset]. http://doi.org/10.17632/vjyh3g8w2r.2
    Explore at:
    Dataset updated
    Sep 30, 2022
    Authors
    Xiongfei Jiang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data set of this article is related to the paper "Dynamical structure of social map in ancient China" (2022, Physica A, https://doi.org/10.1016/j.physa.2022.128209) . This article demonstrates the data of social relations between cities in ancient China, ranging from 618 AD to 1644 AD. The raw data of social associations between elites used to build social maps are extracted from the China Biographical Database. The raw data contain 14610 elites and 29673 social associations, which cover 366 cities in China. The dataset of this article is relevant both for social and natural scientists interested in the social and economic history of ancient China. The data can be used for further insights/analyses on the evolutionary pattern of geo-social architecture, and the geo-history from the viewpoint of social network.

    The dataset contains $3$ files: "Networks.xlsx", "Coordinates.xlsx", and "SocialMap.html". The "Networks.xlsx" has 3 columns, representing the source node (city), target node (city), and weight of a link between two nodes, respectively. The "Networks.xlsx" contains $9$ sheets, which are the data for different dynasties named by Early Tang, Late Tang, Early Northern-Song, Late Northern-Song, Early Southern-Song, Late Southern-Song, Yuan, Early Ming, and Late Ming. Noticeably, the "Networks.xlsx" can be visualized by the network software of Gephi directly. The "Coordinates.xlsx" has 4 columns storing longitude and latitude for all cities that appeared in 9 networks. The first and second columns are English names and Chinese names of cities; the third and fourth columns are longitudes and latitudes of cities. The "SocialMap.html" provides a visualization platform, in which users could select and illustrate the evolution of social maps intuitively.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Lizhe Wang; Lajiao Chen (2023). Spatiotemporal data on Chinese population distribution from 1949 to 2013 [Dataset]. http://doi.org/10.6084/m9.figshare.c.3291368_D2.v1
Organization logoOrganization logo

Spatiotemporal data on Chinese population distribution from 1949 to 2013

Related Article
Explore at:
zipAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Lizhe Wang; Lajiao Chen
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

Population: This dataset contains 65-years’ time serial data of whole China (unit: million persons), each provinces (unit: 10000 persons), and each county. The source data are originally collected from China Statistical Yearbook from 1949 to 2013. The county data covers 2000, 2006, 2007, and 2009. In addition, 4 years (1995, 2000, 2005, 2010) population distributions cover the whole land region in China are also included in this dataset. Such data is expressed as raster format with 1 km resolution and a projection of Albers. Attribute information mainly includes population density (unit: number of person per square kilometer). The source data are originally provided by Data Center for Resources and Environmental Sciences, Chinese Academy of Sciences (RESDC) (http://www.resdc.cn) and Data Sharing Infrastructure of Earth System Science (http://www.geodata.cn).These data are not intended for demarcation.

Search
Clear search
Close search
Google apps
Main menu