95 datasets found
  1. Z

    Modern China Geospatial Database - Main Dataset

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Henriot (2025). Modern China Geospatial Database - Main Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5735393
    Explore at:
    Dataset updated
    Feb 28, 2025
    Dataset authored and provided by
    Christian Henriot
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    MCGD_Data_V2.2 contains all the data that we have collected on locations in modern China, plus a number of locations outside of China that we encounter frequently in historical sources on China. All further updates will appear under the name "MCGD_Data" with a time stamp (e.g., MCGD_Data2023-06-21)

    You can also have access to this dataset and all the datasets that the ENP-China makes available on GitLab: https://gitlab.com/enpchina/IndexesEnp

    Altogether there are 464,970 entries. The data include the name of locations and their variants in Chinese, pinyin, and any recorded transliteration; the name of the province in Chinese and in pinyin; Province ID; the latitude and longitude; the Name ID and Location ID, and NameID_Legacy. The Name IDs all start with H followed by seven digits. This is the internal ID system of MCGD (the NameID_Legacy column records the Name IDs in their original format depending on the source). Locations IDs that start with "DH" are data points extracted from China Historical GIS (Harvard University); those that start with "D" are locations extracted from the data points in Geonames; those that have only digits (8 digits) are data points we have added from various map sources.

    One of the main features of the MCGD Main Dataset is the systematic collection and compilation of place names from non-Chinese language historical sources. Locations were designated in transliteration systems that are hardly comprehensible today, which makes it very difficult to find the actual locations they correspond to. This dataset allows for the conversion from these obsolete transliterations to the current names and geocoordinates.

    From June 2021 onward, we have adopted a different file naming system to keep track of versions. From MCGD_Data_V1 we have moved to MCGD_Data_V2. In June 2022, we introduced time stamps, which result in the following naming convention: MCGD_Data_YYYY.MM.DD.

    UPDATES

    MCGD_Data2025_02_28 includes a major change with the duplication of all the locations listed under Beijing, Shanghai, Tianjin, and Chongqing (北京, 上海, 天津, 重慶) and their listing under the name of the provinces to which they belonge origially before the creation of the four special municipalities after 1949. This is meant to facilitate the matching of data from historical sources. Each location has a unique NameID. Altogether there are 472,818 entries

    MCGD_Data2025_02_27 inclues an update on locations extracted from Minguo zhengfu ge yuanhui keyuan yishang zhiyuanlu 國民政府各院部會科員以上職員錄 (Directory of staff members and above in the ministries and committees of the National Government). Nanjing: Guomin zhengfu wenguanchu yinzhuju 國民政府文官處印鑄局國民政府文官處印鑄局, 1944). We also made corrections in the Prov_Py and Prov_Zh columns as there were some misalignments between the pinyin name and the name in Chines characters. The file now includes 465,128 entries.

    MCGD_Data2024_03_23 includes an update on locations in Taiwan from the Asia Directories. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown").

    MCGD_Data2023.12.22 contains all the data that we have collected on locations in China, whatever the period. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown"). The dataset also includes locations outside of China for the purpose of matching such locations to the place names extracted from historical sources. For example, one may need to locate individuals born outside of China. Rather than maintaining two separate files, we made the decision to incorporate all the place names found in historical sources in the gazetteer. Such place names can easily be removed by selecting all the entries where the 'Province' data is missing.

  2. S

    Data from: Information dataset of China’s overseas industrial parks from...

    • scidb.cn
    Updated Jul 4, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    李祜梅; 邬明权; 牛铮; 李旗 (2019). Information dataset of China’s overseas industrial parks from 1992 to 2018 [Dataset]. http://doi.org/10.11922/sciencedb.797
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 4, 2019
    Dataset provided by
    Science Data Bank
    Authors
    李祜梅; 邬明权; 牛铮; 李旗
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    Since China’s initiative of the Belt and Road Initiative, overseas industrial parks have become an important carrier for economic and trade cooperation and have become an important force for Chinese enterprises to go global.However, although there are many industrial parks invested by Chinese companies abroad, there is not yet a comprehensive statistical work that is crucial for national or corporate investors.The start-up time of some parks and the name of Chinese enterprises that are under construction are difficult to find, so comprehensive statistical work is relatively difficult.This paper collects data through the network crawling technology, the public number of the Belt and Road International Industrial Park, the official website of the major enterprises participating in the Belt and Road construction, and the database of the Ministry of Commerce.Under the most comprehensive collection possible, a detailed data set of the China Outland Campus Belt and Road Project from 1992 to 2018 was compiled.This data set summarizes the existing park names and determines the total number of parks currently built in China; statistics on the number of parks on each continent to understand the distribution of the park; then analyze the type of the park, and understand the distribution of resources in the area by type; finally,compare the time between the construction of the park and the time of the country where the park is located join the Asian Infrastructure Investment Bank(AIIB) to know the relationship between the AIIB and the park.

  3. T

    China Unemployment Rate

    • tradingeconomics.com
    • jp.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). China Unemployment Rate [Dataset]. https://tradingeconomics.com/china/unemployment-rate
    Explore at:
    csv, xml, excel, jsonAvailable download formats
    Dataset updated
    Jul 11, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Sep 30, 2002 - May 31, 2025
    Area covered
    China
    Description

    Unemployment Rate in China decreased to 5 percent in May from 5.10 percent in April of 2025. This dataset provides - China Unemployment Rate - actual values, historical data, forecast, chart, statistics, economic calendar and news.

  4. T

    Chinese Yuan Data

    • tradingeconomics.com
    • jp.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Jan 4, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2017). Chinese Yuan Data [Dataset]. https://tradingeconomics.com/china/currency
    Explore at:
    xml, csv, excel, jsonAvailable download formats
    Dataset updated
    Jan 4, 2017
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 2, 1981 - Jul 11, 2025
    Area covered
    China
    Description

    The USD/CNY exchange rate fell to 7.1727 on July 11, 2025, down 0.09% from the previous session. Over the past month, the Chinese Yuan has strengthened 0.01%, and is up by 1.35% over the last 12 months. Chinese Yuan - values, historical data, forecasts and news - updated on July of 2025.

  5. S

    Data from: A standardized dataset of built-up areas of China’s cities with...

    • scidb.cn
    Updated Jul 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiang Huiping; Sun Zhongchang; Guo Huadong; Du Wenjie; Xing Qiang; Cai Guoyin (2021). A standardized dataset of built-up areas of China’s cities with populations over 300,000 for the period 1990–2015 [Dataset]. http://doi.org/10.11922/sciencedb.j00076.00004
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 7, 2021
    Dataset provided by
    Science Data Bank
    Authors
    Jiang Huiping; Sun Zhongchang; Guo Huadong; Du Wenjie; Xing Qiang; Cai Guoyin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    Here we used remote sensing data from multiple sources (time-series of Landsat and Sentinel images) to map the impervious surface area (ISA) at five-year intervals from 1990 to 2015, and then converted the results into a standardized dataset of the built-up area for 433 Chinese cities with 300,000 inhabitants or more, which were listed in the United Nations (UN) World Urbanization Prospects (WUP) database (including Mainland China, Hong Kong, Macao and Taiwan). We employed a range of spectral indices to generate the 1990–2015 ISA maps in urban areas based on remotely sensed data acquired from multiple sources. In this process, various types of auxiliary data were used to create the desired products for urban areas through manual segmentation of peri-urban and rural areas together with reference to several freely available products of urban extent derived from ISA data using automated urban–rural segmentation methods. After that, following the well-established rules adopted by the UN, we carried out the conversion to the standardized built-up area products from the 1990–2015 ISA maps in urban areas, which conformed to the definition of urban agglomeration area (UAA). Finally, we implemented data postprocessing to guarantee the spatial accuracy and temporal consistency of the final product.The standardized urban built-up area dataset (SUBAD–China) introduced here is the first product using the same definition of UAA adopted by the WUP database for 433 county and higher-level cities in China. The comparisons made with contemporary data produced by the National Bureau of Statistics of China, the World Bank and UN-habitat indicate that our results have a high spatial accuracy and good temporal consistency and thus can be used to characterize the process of urban expansion in China.The SUBAD–China contains 2,598 vector files in shapefile format containing data for all China's cities listed in the WUP database that have different urban sizes and income levels with populations over 300,000. Attached with it, we also provided the distribution of validation points for the 1990–2010 ISA products of these 433 Chinese cities in shapefile format and the confusion matrices between classified data and reference data during different time periods as a Microsoft Excel Open XML Spreadsheet (XLSX) file.Furthermore, The standardized built-up area products for such cities will be consistently updated and refined to ensure the quality of their spatiotemporal coverage and accuracy. The production of this dataset together with the usage of population counts derived from the WUP database will close some of the data gaps in the calculation of SDG11.3.1 and benefit other downstream applications relevant to a combined analysis of the spatial and socio-economic domains in urban areas.

  6. F

    Mandarin General Conversation Speech Dataset for ASR

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Mandarin General Conversation Speech Dataset for ASR [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/general-conversation-mandarin-china
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    Welcome to the Mandarin Chinese General Conversation Speech Dataset — a rich, linguistically diverse corpus purpose-built to accelerate the development of Mandarin speech technologies. This dataset is designed to train and fine-tune ASR systems, spoken language understanding models, and generative voice AI tailored to real-world Mandarin Chinese communication.

    Curated by FutureBeeAI, this 30 hours dataset offers unscripted, spontaneous two-speaker conversations across a wide array of real-life topics. It enables researchers, AI developers, and voice-first product teams to build robust, production-grade Mandarin speech models that understand and respond to authentic Chinese accents and dialects.

    Speech Data

    The dataset comprises 30 hours of high-quality audio, featuring natural, free-flowing dialogue between native speakers of Mandarin Chinese. These sessions range from informal daily talks to deeper, topic-specific discussions, ensuring variability and context richness for diverse use cases.

    Participant Diversity:
    Speakers: 60 verified native Mandarin Chinese speakers from FutureBeeAI’s contributor community.
    Regions: Representing various provinces of China to ensure dialectal diversity and demographic balance.
    Demographics: A balanced gender ratio (60% male, 40% female) with participant ages ranging from 18 to 70 years.
    Recording Details:
    Conversation Style: Unscripted, spontaneous peer-to-peer dialogues.
    Duration: Each conversation ranges from 15 to 60 minutes.
    Audio Format: Stereo WAV files, 16-bit depth, recorded at 16kHz sample rate.
    Environment: Quiet, echo-free settings with no background noise.

    Topic Diversity

    The dataset spans a wide variety of everyday and domain-relevant themes. This topic diversity ensures the resulting models are adaptable to broad speech contexts.

    Sample Topics Include:
    Family & Relationships
    Food & Recipes
    Education & Career
    Healthcare Discussions
    Social Issues
    Technology & Gadgets
    Travel & Local Culture
    Shopping & Marketplace Experiences, and many more.

    Transcription

    Each audio file is paired with a human-verified, verbatim transcription available in JSON format.

    Transcription Highlights:
    Speaker-segmented dialogues
    Time-coded utterances
    Non-speech elements (pauses, laughter, etc.)
    High transcription accuracy, achieved through double QA pass, average WER < 5%

    These transcriptions are production-ready, enabling seamless integration into ASR model pipelines or conversational AI workflows.

    Metadata

    The dataset comes with granular metadata for both speakers and recordings:

    Speaker Metadata: Age, gender, accent, dialect, state/province, and participant ID.
    Recording Metadata: Topic, duration, audio format, device type, and sample rate.

    Such metadata helps developers fine-tune model training and supports use-case-specific filtering or demographic analysis.

    Usage and Applications

    This dataset is a versatile resource for multiple Mandarin speech and language AI applications:

    ASR Development: Train accurate speech-to-text systems for Mandarin Chinese.
    Voice Assistants: Build smart assistants capable of understanding natural Chinese conversations.
    <div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px;

  7. A

    China - Subnational Administrative Boundaries

    • data.amerigeoss.org
    • data.humdata.org
    emf, geodatabase, shp +1
    Updated Jul 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UN Humanitarian Data Exchange (2025). China - Subnational Administrative Boundaries [Dataset]. https://data.amerigeoss.org/dataset/china-administrative-boundaries
    Explore at:
    shp(3635723), xlsx(39574), geodatabase(798645), emf(858116)Available download formats
    Dataset updated
    Jul 1, 2025
    Dataset provided by
    UN Humanitarian Data Exchange
    Area covered
    China
    Description

    China administrative level 0-2 boundaries (COD-AB) dataset.

    The date that these administrative boundaries were established is unknown.

    NOTE: COD-AB source is unofficial. COD-AB reflects Chinese and UN recognition of 'Taiwan' as a Chinese ADM1 feature.

    This COD-AB was most recently reviewed for accuracy and necessary changes in October 2024. The COD-AB does not require any update.

    Sourced from publicly available online sources

    Vetting by Information Technology Outreach Services (ITOS) with funding from USAID.

    There is no suitable population statistics dataset (COD-PS) for linkage to this COD-AB..

    No edge-matched (COD-EM) version of this COD-AB has yet been prepared.

    Please see the COD Portal.

    Administrative level 1 contains 34 feature(s). The normal administrative level 1 feature type is ""currently not known"".

    Administrative level 2 contains 361 feature(s). The normal administrative level 2 feature type is ""currently not known"".

    "China administrative level 0-2 boundaries (COD-AB) dataset.

    The date that these administrative boundaries were established is unknown.

    NOTE: COD-AB source is unofficial. COD-AB reflects Chinese and UN recognition of 'Taiwan' as a Chinese ADM1 feature.

    This COD-AB was most recently reviewed for accuracy and necessary changes in October 2024. The COD-AB does not require any update.

    Sourced from publicly available online sources

    Vetting by Information Technology Outreach Services (ITOS) with funding from USAID.

    There is no suitable population statistics dataset (COD-PS) for linkage to this COD-AB..

    No edge-matched (COD-EM) version of this COD-AB has yet been prepared.

    Please see the COD Portal.

    Administrative level 1 contains 34 feature(s). The normal administrative level 1 feature type is 'currently not known'.

    Administrative level 2 contains 361 feature(s). The normal administrative level 2 feature type is 'currently not known'.

    Recommended cartographic projection: Asia South Albers Equal Area Conic

    This metadata was last updated on January 18, 2025.

  8. T

    China Average Yearly Wages

    • tradingeconomics.com
    • de.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). China Average Yearly Wages [Dataset]. https://tradingeconomics.com/china/wages
    Explore at:
    json, xml, csv, excelAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 31, 1952 - Dec 31, 2024
    Area covered
    China
    Description

    Wages in China increased to 120698 CNY/Year in 2023 from 114029 CNY/Year in 2022. This dataset provides - China Average Yearly Wages - actual values, historical data, forecast, chart, statistics, economic calendar and news.

  9. S

    Chinese Human Connectome Project

    • scidb.cn
    Updated Dec 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guoyuan Yang; Jianqiao Ge; Jia-Hong Gao (2022). Chinese Human Connectome Project [Dataset]. http://doi.org/10.11922/sciencedb.01374
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 3, 2022
    Dataset provided by
    Science Data Bank
    Authors
    Guoyuan Yang; Jianqiao Ge; Jia-Hong Gao
    Description

    CHCP Overview:The human behavior and brain are shaped by genetic, environmental and cultural interactions. Recent advances in neuroimaging integrate multimodal imaging data from a large population and start to explore the large-scale structural and functional connectomic architectures of the human brain. One of the major pioneers is the Human Connectome Project (HCP) that developed sophisticated imaging protocols and has built a collection of high-quality multimodal neuroimaging, behavioral and genetic data from US population. A large-scale neuroimaging project parallel to the HCP, but with a focus on the East Asian population, will allow comparisons of brain-behavior associations across different ethnicities and cultures. The Chinese Human Connectome Project (CHCP) is launched in 2017 and led by Professor Jia-Hong GAO at Peking University, Beijing, China. CHCP aims to provide large sets of multimodal neuroimaging, behavioral and genetic data on the Chinese population that are comparable to the data of the HCP. The CHCP protocols were almost identical to those of the HCP, including the procedure for 3T MRI scanning, the data acquisition parameters, and the task paradigms for functional brain imaging. The CHCP also collected behavioral and genetic data that were compatible with the HCP dataset. The first public release of the CHCP dataset is in 2022. CHCP dataset includes high-resolution structural MR images (T1W and T2W), resting-state fMRI (rfMRI), task fMRI (tfMRI), and high angular resolution diffusion MR images (dMRI) of the human brain as well as behavioral data based on Chinese population. The unprocessed "raw" images of CHCP dataset (about 1.85 TB) have been released on the platform and can be downloaded. Considering our current cloud-storage service, sharing full preprocessed images (up to 70 TB) requires further construction. We will be actively cooperating with researchers who contact us for academic request, offering case-by-case solution to access the preprocessed data in a timely manner, such as by mailing hard disks or a third-party trusted cloud-storage service. V2 Release (Date: January 16, 2023):Here, we released the seven major domains task fMRI EVs files, including: 1) visual, motion, somatosensory, and motor systems; 2) category specific representations; 3) working memory/cognitive control systems; 4) language processing (semantic and phonological processing); 5) social cognition (Theory of Mind); 6) relational processing; and 7) emotion processing.V3 Release (Date: January 12, 2024):This version of data release primarily discloses the CHCP raw MRI dataset that underwent “HCP minimal preprocessing pipeline”, located in CHCP_ScienceDB_preproc folder (about 6.90 TB). In this folder, preprocessed MRI data includes T1W, T2W, rfMRI, tfMRI, and dMRI modalities for all young adulthood participants, as well as partial results for middle-aged and older adulthood participants in the CHCP dataset. Following the data sharing strategy of the HCP, we have eliminated some redundant preprocessed data, resulting in a final total size of the preprocessed CHCP dataset is about 6.90 TB in zip files. V4 Release (Date: December 4, 2024):In this update, we have fixed the issue with the corrupted compressed file of preprocessed data for subject 3011, and removed the incorrect preprocessed results for subject 3090. Additionally, we have updated the subject file information list. Additionally, this release includes the update of unprocessed "raw" images of the CHCP dataset in CHCP_ScienceDB_unpreproc folder (about 1.85 TB), addressing the previously insufficient anonymization of T1W and T2W modalities data for some older adulthood participants in versions V1 and V2. For more detailed information, please refer to the data descriptions in versions V1 and V2.CHCP Summary:Subjects:366 healthy adults (Chinese Han)Imaging Scanner:3T MR (Siemens Prisma)Institution:Peking University, Beijing, ChinaFunding Agencies:Beijing Municipal Science & Technology CommissionChinese Institute for Brain Research (Beijing)National Natural Science Foundation of ChinaMinistry of Science and Technology of China CHCP Citations:Papers, book chapters, books, posters, oral presentations, and all other printed and digital presentations of results derived from CHCP data should contain the following wording in the acknowledgments section: "Data were provided [in part] by the Chinese Human Connectome Project (CHCP, PI: Jia-Hong Gao) funded by the Beijing Municipal Science & Technology Commission, Chinese Institute for Brain Research (Beijing), National Natural Science Foundation of China, and the Ministry of Science and Technology of China."

  10. a

    Chinese Public Diplomacy activities

    • aiddata.org
    Updated Dec 13, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Chinese Public Diplomacy activities [Dataset]. https://www.aiddata.org/data/chinas-public-diplomacy-dashboard-dataset-version-2
    Explore at:
    Dataset updated
    Dec 13, 2021
    Area covered
    China
    Description

    This dataset contains first-of-its-kind quantifiable data on China’s public diplomacy efforts from five of AidData’s reports—Ties That Bind, Influencing the Narrative, Silk Road Diplomacy, Listening to Leaders 2021, and Corridors of Power—that is available through AidData’s China’s Public Diplomacy Dashboard. This includes metrics for five types of public diplomacy: financial, cultural, elite-to-elite, exchange, and informational.

  11. China Overseas Finance Inventory Database - Datasets - Data | World...

    • old-datasets.wri.org
    Updated Mar 21, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wri.org (2022). China Overseas Finance Inventory Database - Datasets - Data | World Resources Institute [Dataset]. https://old-datasets.wri.org/dataset/cofi
    Explore at:
    Dataset updated
    Mar 21, 2022
    Dataset provided by
    World Resources Institutehttps://www.wri.org/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    The COFI database includes power-generation projects in Belt and Road Initiative (BRI) countries financed by Chinese corporations and banks that reached financial closure from 2000 to 2023. Types of financing include debt and equity investment, with the latter including greenfield foreign direct investments (FDI) and cross-border mergers and acquisitions (M&As). COFI is consolidated using nine source databases using both automated join method in R Studio, and manual joining by analysts. The database includes power plant characteristics data and investment detail data. It captures 575 power plants in 87 BRI countries, including 314 equity investment transactions and 341 debt investment transactions made by Chinese investors. Key data points for financial transactions in COFI include the financial instrument (equity or debt), investor name, amount, and financial close year. Key technical characteristics tracked for projects in COFI include name, installed capacity, commissioning year, country, and primary fuel type. This project is a collaboration among the Boston University Global Development Policy Center, the Inter-American Dialogue, the China-Africa Research Initiative at the Johns Hopkins University (CARI), and the World Resources Institute (WRI). The detailed methodology is given in the World Resources Institute publication “China Overseas Finance Inventory”. Cautions When analyzing debt investment amounts, users should be aware of the difference between loan commitment and actual disbursement. Our database records the loan commitment for a certain year and not actual disbursement. The investment amount should only provide a rough picture of where Chinese companies are investing and not how much their exact portion is. In this version of the database, all equity investment amounts are missing. This is because the equity amount is either missing or estimated in the source databases. Citation

  12. Modern China Geospatial Database - Main Dataset

    • zenodo.org
    bin, csv
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Henriot; Christian Henriot (2023). Modern China Geospatial Database - Main Dataset [Dataset]. http://doi.org/10.5281/zenodo.5866104
    Explore at:
    csv, binAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Christian Henriot; Christian Henriot
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    MCGD_Data_V2.1 contains all the data that we have collected on locations in modern China. Altogether there are 464,887 entries. The data include the name of locations and their variants in Chinese, pinyin, and any recorded transliteration; the name of the province in Chinese and in pinyin; Province ID; the latitude and longitude; the Name ID and Location ID, and NameID_Legacy. The Name IDs all start with H followed by seven digits. This is the internal ID system of MCGD (the NameID_Legacy column records the Name IDs in their original format depending on the source). Locations IDs that start with "DH" are data points extracted from China Historical GIS (Harvard University); those that start with "D" are locations extracted from the data points in Geonames; those that have only digits (8 digits) are data points we have added from various map sources.

    One of the main features of the MCGD Main Dataset is the systematic collection and compilation of place names from non-Chinese language historical sources. Locations were designated in transliteration systems that are hardly comprehensible today, which makes it very difficult to find the actual locations they correspond to. This dataset allows for the conversion from these obsolete transliterations to the current names and geocoordinates.

  13. T

    China Consumer Confidence

    • tradingeconomics.com
    • fa.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS, China Consumer Confidence [Dataset]. https://tradingeconomics.com/china/consumer-confidence
    Explore at:
    excel, xml, json, csvAvailable download formats
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 31, 1991 - May 31, 2025
    Area covered
    China
    Description

    Consumer Confidence in China increased to 88 points in May from 87.80 points in April of 2025. This dataset provides - China Consumer Confidence - actual values, historical data, forecast, chart, statistics, economic calendar and news.

  14. Chinese Domestic Databases Market Report | Global Forecast From 2025 To 2033...

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Chinese Domestic Databases Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/chinese-domestic-databases-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Oct 16, 2024
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global, China
    Description

    Chinese Domestic Databases Market Outlook



    The Chinese Domestic Databases market size is set for robust growth, projected to grow from USD 2 billion in 2023 to USD 6.5 billion by 2032, reflecting an impressive CAGR of 13.5%. This growth is driven by the increasing demand for data sovereignty, technological advancements, and regulatory support from the Chinese government. The market is primed for expansion, propelled by factors such as the burgeoning digital economy, increased cloud adoption, and the strategic focus on indigenous technological advancements.



    One of the primary growth factors for the Chinese Domestic Databases market is the increasing emphasis on data sovereignty and security. With the Chinese government imposing stringent regulations on data storage and management, domestic companies are compelled to utilize local databases to ensure compliance. This has created a favorable environment for the growth of domestic database providers who are tailored to meet these unique requirements. Additionally, the rise in cyber threats has further driven the need for secure and reliable database solutions, contributing significantly to market growth.



    Technological advancements and innovation within the database industry are also pivotal growth drivers. The rapid development of Artificial Intelligence (AI) and Machine Learning (ML) technologies has allowed for more efficient and intelligent database management systems. Innovations in data handling, processing speed, and storage capabilities provide a significant competitive edge to domestic databases over international counterparts. Furthermore, the integration of AI and ML with databases enables advanced analytics and insights, helping businesses make more informed decisions, thus driving the market forward.



    The digital transformation across various sectors in China has also fueled the demand for robust database solutions. Sectors such as finance, healthcare, and retail are increasingly relying on digital platforms for their operations, necessitating sophisticated and reliable databases to manage vast amounts of data. The push towards a digital economy by the Chinese government, coupled with initiatives like the "New Infrastructure" program, which focuses on the development of digital infrastructure including big data centers, has significantly boosted the demand for domestic databases.



    Regionally, East China dominates the market due to the presence of major economic hubs like Shanghai and Hangzhou, which are home to numerous technology companies and data centers. North China, with Beijing as its central hub, also plays a significant role in the market due to the concentration of governmental bodies and financial institutions that demand secure and compliant database solutions. South China, particularly Shenzhen, is another critical region, given its prominence as a technology and innovation hub. Central China and other regions are gradually catching up as investments in digital infrastructure spread across the country. Overall, the regional dynamics of the Chinese Domestic Databases market present a diverse and rapidly evolving landscape.



    Type Analysis



    The Chinese Domestic Databases market comprises various types, including Relational Databases, NoSQL Databases, NewSQL Databases, and others. Relational Databases have been the cornerstone of the database industry for decades, offering structured data storage and easy retrieval through SQL queries. Despite their age, they remain highly relevant due to their robustness, reliability, and the vast ecosystems that have developed around them. In China, relational databases continue to be widely adopted across various industries, particularly in sectors like finance and government, where data accuracy and consistency are paramount.



    NoSQL Databases have gained significant traction in recent years due to their flexibility, scalability, and ability to handle unstructured data. Unlike traditional relational databases, NoSQL databases can seamlessly manage large volumes of diverse data types, making them ideal for applications in big data and real-time web applications. In China, the adoption of NoSQL databases is particularly prominent in the e-commerce and social media sectors, where the ability to scale out horizontally and handle high-velocity data is crucial.



    NewSQL Databases represent a hybrid approach that combines the best features of traditional relational databases and NoSQL databases. They offer the scalability and flexibility of NoSQL while maintaining the ACID (Atomicity, Consistency, Isolation, Durability) prope

  15. T

    China GDP

    • tradingeconomics.com
    • it.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS, China GDP [Dataset]. https://tradingeconomics.com/china/gdp
    Explore at:
    csv, json, excel, xmlAvailable download formats
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 31, 1960 - Dec 31, 2024
    Area covered
    China
    Description

    The Gross Domestic Product (GDP) in China was worth 18743.80 billion US dollars in 2024, according to official data from the World Bank. The GDP value of China represents 17.65 percent of the world economy. This dataset provides - China GDP - actual values, historical data, forecast, chart, statistics, economic calendar and news.

  16. Music Data Sharing Platform for Computational Musicology Research (CCMUSIC...

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Nov 15, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhaorui Liu; Zijin Li; Zhaorui Liu; Zijin Li (2021). Music Data Sharing Platform for Computational Musicology Research (CCMUSIC DATASET) [Dataset]. http://doi.org/10.5281/zenodo.5676893
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 15, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Zhaorui Liu; Zijin Li; Zhaorui Liu; Zijin Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This platform is a multi-functional music data sharing platform for Computational Musicology research. It contains many music datas such as the sound information of Chinese traditional musical instruments and the labeling information of Chinese pop music, which is available for free use by computational musicology researchers.

    This platform is also a large-scale music data sharing platform specially used for Computational Musicology research in China, including 3 music databases: Chinese Traditional Instrument Sound Database (CTIS), Midi-wav Bi-directional Database of Pop Music and Multi-functional Music Database for MIR Research (CCMusic). All 3 databases are available for free use by computational musicology researchers. For the contents contained in the database, we will provide audio files recorded by the professional team of the conservatory of music, as well as corresponding labelled files, which have no commodity copyright problem and facilitate large-scale promotion. We hope that this music data sharing platform can meet the one-stop data needs of users and contribute to the research in the field of Computational Musicology.

    If you want to know more information or obtain complete files, please go to the official website of this platform:

    Music Data Sharing Platform for Academic Research

    • Chinese Traditional Instrument Sound Database (CTIS)

    This database is developed by Prof. Han Baoqiang's team for many years, which collects sound information about Chinese traditional musical instruments. The database includes 287 Chinese national musical instruments, including traditional musical instruments, improved musical instruments and ethnic minority musical instruments.

    • Multi-functional Music Database for MIR Research

    This database collects sound materials of pop music, folk music and hundreds of national musical instruments, and makes comprehensive annotation to form a multi-purpose music database for MIR researchers.

    • Midi-wav Bi-directional Database of Pop Music

    This database contains hundreds of Chinese pop songs, and each song contains the corresponding midi-audio-lyric information. Among them, recording the vocal part and accompaniment part of audio independently is helpful to study the MIR task under the ideal situation. In addition, the information of singing techniques consistent with vocal part (such as breath sound, falsetto, breathing, vibrato, mute, slide, etc.) is marked in MuseScore, which constitutes a Midi-Wav bi-direction corresponding pop music database.

  17. T

    China Industrial Production

    • tradingeconomics.com
    • es.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). China Industrial Production [Dataset]. https://tradingeconomics.com/china/industrial-production
    Explore at:
    xml, csv, json, excelAvailable download formats
    Dataset updated
    Jun 16, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 31, 1990 - May 31, 2025
    Area covered
    China
    Description

    Industrial Production in China increased 5.80 percent in May of 2025 over the same month in the previous year. This dataset provides - China Industrial Production - actual values, historical data, forecast, chart, statistics, economic calendar and news.

  18. d

    Company datasets of 31M companies in China

    • datarade.ai
    Updated Apr 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BoldData (2021). Company datasets of 31M companies in China [Dataset]. https://datarade.ai/data-products/list-of-31m-companies-in-china-bolddata
    Explore at:
    .json, .csv, .xls, .txtAvailable download formats
    Dataset updated
    Apr 17, 2021
    Dataset authored and provided by
    BoldData
    Area covered
    China
    Description

    With over 30 million businesses, all held in-house, BoldData has the largest supply of Chinese data. We can select your perfect target based on numerous interesting selections: from 3,000 industries to region, turnover, sector, contact person and the number of employees.

    Other questions or are you looking for another city or country? Our data experts are specialized in supervising international campaigns. We have specific direct marketing knowledge per country and have highly accurate data of 300 million companies in 150+ countries. Contact us for free tailor-made advice and an independent quote.

  19. m

    The Motivations for Fashion Shopping in China (SPSS Dataset)

    • data.mendeley.com
    Updated Jul 2, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christopher J. Parker (2018). The Motivations for Fashion Shopping in China (SPSS Dataset) [Dataset]. http://doi.org/10.17632/bzn593sv5d.1
    Explore at:
    Dataset updated
    Jul 2, 2018
    Authors
    Christopher J. Parker
    License

    Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
    License information was derived automatically

    Area covered
    China
    Description

    In this study, 403 Chinese consumers generalizable to the broader population were surveyed on their motivations to shop for fashion apparel in both high street and e-commerce environments. Statistical analysis was undertaken through multiple T-Tests and MANOVA with the assistance of SPSS and G*Power.

    To increase the profits of international brands, this paper presents the motivations of Chinese consumers to engage in fashion retail, building upon established theory in hedonic and utilitarian motivations. With China set to capture over 24% of the $212 billion fashion market, international brands need to understand the unique motivations of Chinese consumers in order to capitalise on the market. However, the motivations of Chinese people to engage in fashion retail are as yet undefined, limiting the ability for international fashion retailers to operate with prosperity in the Chinese market.

  20. The 30 m annual land cover datasets and its dynamics in China from 1985 to...

    • zenodo.org
    bin, jpeg, tiff, zip
    Updated Aug 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jie Yang; Xin Huang; Jie Yang; Xin Huang (2024). The 30 m annual land cover datasets and its dynamics in China from 1985 to 2023 [Dataset]. http://doi.org/10.5281/zenodo.12779975
    Explore at:
    tiff, bin, zip, jpegAvailable download formats
    Dataset updated
    Aug 7, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jie Yang; Xin Huang; Jie Yang; Xin Huang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Using 335,709 Landsat images on the Google Earth Engine, we built the first Landsat-derived annual land cover product of China (CLCD) from 1985 to 2019. We collected the training samples by combining stable samples extracted from China's Land-Use/Cover Datasets (CLUD), and visually-interpreted samples from satellite time-series data, Google Earth and Google Map. Several temporal metrics were constructed via all available Landsat data and fed to the random forest classifier to obtain classification results. A post-processing method incorporating spatial-temporal filtering and logical reasoning was further proposed to improve the spatial-temporal consistency of CLCD.

    "*_albert.tif" are projected files via a proj4 string "+proj=aea +lat_1=25 +lat_2=47 +lat_0=0 +lon_0=105 +x_0=0 +y_0=0 +datum=WGS84 +units=m +no_defs".

    CLCD in 2023 is now available.

    1. Given that the USGS no longer maintains the Landsat Collection 1 data, we are now using the Collection 2 SR data to update the CLCD.

    2. All files in this version have been exported as Cloud Optimized GeoTIFF for more efficient processing on the cloud. Please check here for more details.

    3. Internal overviews and color tables are built into each file to speed up software loading and rendering.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Christian Henriot (2025). Modern China Geospatial Database - Main Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5735393

Modern China Geospatial Database - Main Dataset

Explore at:
Dataset updated
Feb 28, 2025
Dataset authored and provided by
Christian Henriot
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
China
Description

MCGD_Data_V2.2 contains all the data that we have collected on locations in modern China, plus a number of locations outside of China that we encounter frequently in historical sources on China. All further updates will appear under the name "MCGD_Data" with a time stamp (e.g., MCGD_Data2023-06-21)

You can also have access to this dataset and all the datasets that the ENP-China makes available on GitLab: https://gitlab.com/enpchina/IndexesEnp

Altogether there are 464,970 entries. The data include the name of locations and their variants in Chinese, pinyin, and any recorded transliteration; the name of the province in Chinese and in pinyin; Province ID; the latitude and longitude; the Name ID and Location ID, and NameID_Legacy. The Name IDs all start with H followed by seven digits. This is the internal ID system of MCGD (the NameID_Legacy column records the Name IDs in their original format depending on the source). Locations IDs that start with "DH" are data points extracted from China Historical GIS (Harvard University); those that start with "D" are locations extracted from the data points in Geonames; those that have only digits (8 digits) are data points we have added from various map sources.

One of the main features of the MCGD Main Dataset is the systematic collection and compilation of place names from non-Chinese language historical sources. Locations were designated in transliteration systems that are hardly comprehensible today, which makes it very difficult to find the actual locations they correspond to. This dataset allows for the conversion from these obsolete transliterations to the current names and geocoordinates.

From June 2021 onward, we have adopted a different file naming system to keep track of versions. From MCGD_Data_V1 we have moved to MCGD_Data_V2. In June 2022, we introduced time stamps, which result in the following naming convention: MCGD_Data_YYYY.MM.DD.

UPDATES

MCGD_Data2025_02_28 includes a major change with the duplication of all the locations listed under Beijing, Shanghai, Tianjin, and Chongqing (北京, 上海, 天津, 重慶) and their listing under the name of the provinces to which they belonge origially before the creation of the four special municipalities after 1949. This is meant to facilitate the matching of data from historical sources. Each location has a unique NameID. Altogether there are 472,818 entries

MCGD_Data2025_02_27 inclues an update on locations extracted from Minguo zhengfu ge yuanhui keyuan yishang zhiyuanlu 國民政府各院部會科員以上職員錄 (Directory of staff members and above in the ministries and committees of the National Government). Nanjing: Guomin zhengfu wenguanchu yinzhuju 國民政府文官處印鑄局國民政府文官處印鑄局, 1944). We also made corrections in the Prov_Py and Prov_Zh columns as there were some misalignments between the pinyin name and the name in Chines characters. The file now includes 465,128 entries.

MCGD_Data2024_03_23 includes an update on locations in Taiwan from the Asia Directories. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown").

MCGD_Data2023.12.22 contains all the data that we have collected on locations in China, whatever the period. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown"). The dataset also includes locations outside of China for the purpose of matching such locations to the place names extracted from historical sources. For example, one may need to locate individuals born outside of China. Rather than maintaining two separate files, we made the decision to incorporate all the place names found in historical sources in the gazetteer. Such place names can easily be removed by selecting all the entries where the 'Province' data is missing.

Search
Clear search
Close search
Google apps
Main menu