Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Scott M. Brown (University of Puerto Rico)
Email: scott.brown@upr.edu
Data DOI: 10.5281/zenodo.15050209
This project empirically tests how language regimes embedded in legal and administrative systems create institutional traps that constrain multinational enterprise (MNE) operations and economic integration.
The study combines national and subnational data across four key datasets to measure how symbolic misalignment (such as monolingualism in non-commercial languages) affects regulatory quality, business formation, and workforce access.
You must upload the following four files into your Google Colab session before running the code:
Uploaded File | Description |
---|---|
/content/2020_Rankings.xlsx | World Bank Ease of Doing Business (EODB) — Global regulatory efficiency indicators (2020 Edition) |
/content/DBNA 2022 Rank and Scores.xlsx | Doing Business North America (DBNA 2022) — City-level institutional performance across 83 U.S. cities |
/content/Spanish_Speakers_All_States.xlsx | U.S. Census American Community Survey (ACS) — State-level Spanish-speaking and English proficiency data |
/content/wgidataset.xlsx | World Governance Indicators (WGI) — Governance quality measures (Regulatory Quality, Government Effectiveness, etc.) |
Open Google Colab.
Upload the four Excel files listed above.
Copy and paste the Python code provided below into a Colab notebook cell.
Run the code to automatically load the datasets, clean the data, and estimate key regression models.
Symbolic Institutional Traps: Language regimes act as hidden barriers, complicating regulatory navigation and labor market integration.
Symbolic Misalignment: Misfit between administrative languages and global commercial norms raises onboarding costs for MNEs.
Institutional Friction: Language encapsulation isolates economies and reduces foreign direct investment (FDI) attractiveness.
Each dataset has been:
Cleaned for consistent formatting.
Harmonized for cross-dataset integration.
Standardized to facilitate reproducible econometric analysis.
Full codebooks and metadata are available in the appendix of the research paper.
The EF EPI (English Proficiency) dataset was not uploaded here. If available, further regressions on symbolic distance can be run.
If any columns do not match exactly (e.g., different spellings), modify the variable names slightly based on print(dbna.columns)
.
The code generates:
Regression outputs on how Spanish-speaking prevalence correlates with:
Starting a business
Ease of Doing Business
Regulatory quality
Subnational institutional performance differences (Puerto Rico vs. U.S. states).
Open Data: CC BY 4.0 License
Citation Requested:
Brown, S.M. (2025). Symbolic Institutional Traps and the Liability of Foreignness: Language Regimes as Hidden Barriers to Multinational Entry. University of Puerto Rico. DOI: 10.5281/zenodo.15050209
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Scott M. Brown (University of Puerto Rico)
Email: scott.brown@upr.edu
Data DOI: 10.5281/zenodo.15050209
This project empirically tests how language regimes embedded in legal and administrative systems create institutional traps that constrain multinational enterprise (MNE) operations and economic integration.
The study combines national and subnational data across four key datasets to measure how symbolic misalignment (such as monolingualism in non-commercial languages) affects regulatory quality, business formation, and workforce access.
You must upload the following four files into your Google Colab session before running the code:
Uploaded File | Description |
---|---|
/content/2020_Rankings.xlsx | World Bank Ease of Doing Business (EODB) — Global regulatory efficiency indicators (2020 Edition) |
/content/DBNA 2022 Rank and Scores.xlsx | Doing Business North America (DBNA 2022) — City-level institutional performance across 83 U.S. cities |
/content/Spanish_Speakers_All_States.xlsx | U.S. Census American Community Survey (ACS) — State-level Spanish-speaking and English proficiency data |
/content/wgidataset.xlsx | World Governance Indicators (WGI) — Governance quality measures (Regulatory Quality, Government Effectiveness, etc.) |
Open Google Colab.
Upload the four Excel files listed above.
Copy and paste the Python code provided below into a Colab notebook cell.
Run the code to automatically load the datasets, clean the data, and estimate key regression models.
Symbolic Institutional Traps: Language regimes act as hidden barriers, complicating regulatory navigation and labor market integration.
Symbolic Misalignment: Misfit between administrative languages and global commercial norms raises onboarding costs for MNEs.
Institutional Friction: Language encapsulation isolates economies and reduces foreign direct investment (FDI) attractiveness.
Each dataset has been:
Cleaned for consistent formatting.
Harmonized for cross-dataset integration.
Standardized to facilitate reproducible econometric analysis.
Full codebooks and metadata are available in the appendix of the research paper.
The EF EPI (English Proficiency) dataset was not uploaded here. If available, further regressions on symbolic distance can be run.
If any columns do not match exactly (e.g., different spellings), modify the variable names slightly based on print(dbna.columns)
.
The code generates:
Regression outputs on how Spanish-speaking prevalence correlates with:
Starting a business
Ease of Doing Business
Regulatory quality
Subnational institutional performance differences (Puerto Rico vs. U.S. states).
Open Data: CC BY 4.0 License
Citation Requested:
Brown, S.M. (2025). Symbolic Institutional Traps and the Liability of Foreignness: Language Regimes as Hidden Barriers to Multinational Entry. University of Puerto Rico. DOI: 10.5281/zenodo.15050209